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Sequence and protein comparison 



BB Rat CCGCCTCCTCGCCCGCCCTCCCGCAGCCCAGGAGCCGAGGCTGCCGCGGCCGTGGCGGCG 
SHR Rat CCGCCTCCTCGCCCGCCCTCCCGCAGCCCAGGAGCCG AGG C TGCC GCGGCCGTGGCGGCG 
HOMO SAPIENS GASATT t CGgcgC )GAGG GCGGCCGTGGCGGCG 

GAGCCCTCAGCC ATG GCC TCG GGC GAC ACC CTC TAC ATT GCC ACG GAC GGC TCG GAG ATG 
GAGCCCTCAGCC ATG GCC TCG GGC GAC ACC CTC TAC ATT GCC ACG GAC GGC TCG GAG ATG 
GAGCCCTCAGCC ATG GCC TCG GGC GAC ACC CTC TAC ATc GCC ACG GAC GGC TCG GAG ATG 

MASGDT LYI ATDGSEM 
MASGDT LYI ATDGSEM 
MASGDT LYI ATDGSEM 

CCA GCC GAG ATC GTG GAA CTG CAT GAG ATT GAG GTG GAG ACC ATC CCG GTG GAG ACT ATC 
CCA GCC GAG ATC GTG GAA CTG CAT GAG ATT GAG GTG GAG ACC ATC CCG GTG GAG ACT ATC 

CC§ GCC GAG ATC GTG GA§j CTG CAT GAG AT§ GAG GTG GAG ACC ATC CCG GTG GAG AC§ ATC 
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Figure 9 - 1 . Continued 



GGC GAC GAC TCG GAC GGG CTG CGC GCC GAG GAC GGG TTC GAG GAC CAG ATC CTC ATT CCG 
GGC GAC GAC TCG GAC GGG CTG CGC GCC GAG GAC GGG TTC GAG GAC CAG ATC CTC ATT CCG GG| 
GAC GAC TCG GAC GGG CTG CGC GCC GAG GAC GGC TTC GAG GAC gAG ATC CTC ATe CCG 
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Figure 9 - 2 . Continued 
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AGC TCA AAG CTA AAA CGA CAC CAG CTG GTT CAT ACT GGA GAA AAG CCC TTT CAG-GTAGAGC 
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CAGTTCCTGTTCCCCAAACTGCAAGCTAGGGTGCTGGTCAGGGTGGTTGATATCAAGCACTATGGGGCACCGGT 
CAGTTCCTGTTCCCCAAACTGCAAGCTAGGGTGCTGGTCAGGGTGGTTGATATCAAGCACTATGGGGCACCGGT 



TGGGGTATTTTATTCCCATCCCTCCTGTCTGCTTGGGTTCCTGGTTACTGCTCGGGACTGCAGGTGTTACAGAT 
TGGGGTATTTTATTCCCATCCCTCCTGTCTGCTTGGGTTCCTGGTTACTGCTCGGGACTGCAGGTGTTACAGAT 



GGGGGTGGAGGGATTATGCGAAGCACCCCCACACTAAATTTCTAGCAGGTTTACAAAAACTCAACAGTTTTGTTT 
GGGGGTGGAGGGATTATGCGAAGCACCCCCACACTAAATTTCTAGCAGGTTTACAAAAACTCAACAGTTTTGTTT 



TGTAGTGAGTAGTGTGTTGAATTACTGATAGAGTGCTTATAAGTGCTGTTGGCTACAGCTCCAGGTGACACTTG 
TGTAGTGAGTAGTGTGTTGAATTACTGATAGAGTGCTTATAAGTGCTGTTGGCTACAGCTCCAGGTGACACTTG 



Figure 9 - 3 . Continued 



GTGCTGCTTATAGAAGAClCGTGAGTTGACAGTTGGCATCACTAAATATCTTAATCATCTGTAGTCTACTTCCT 
GTGCTGCTTATAGAAGAcfcGTGAGTTGACAGTTGGCATCACTAAATATCTTAATCATCTGTAGTCTACTTCCT 



AGAGTGTCTCTGAAAACACTCAAGCTGTAAATTTGCACTCAGCACAGCCCTTCTGTTTCTCAAGAACTAGCCAT 
AGAGTGTCTCTGAAAACACTCAAGCTGTAAATTTGCACTCAGCACAGCCCTTCTGTTTCTCAAGAACTAGCCAT 
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TTC GAA GGC TGC GGG AAG CGC TTT TCA CTG GAC TTC AAT TTG CGC ACG CAT GTG CGA ATC 
TTC GAA GGC TGC GGG AAG CGC TTT TCA CTG GAC TTC AAT TTG CGC ACf CAT GTG gGA ATC 
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CTTTGTATATTATTTCTAGGAAGAATTTTAAAAATGAATCCTACACACTTAAGGGACATG 
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C 
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ttttgataaagtagtaaaaatttaaaa7vaatactttaataagatgacattgctaagatgc 
ttttgataaagtagtaaaaatttaaaaaaatactttaataagatgacattgctaagatgc 
ttttgataaagtagtaaaaatttaaaaaa-tactttaataagatgacattgctaagatgc 

tctatcttgctctgt/^atctcgtttcaaaaacaaggtgtttttgtaaagtgtggccccaa 
tctatcttgctctgtaatctcgtttcaaaaacaaggtgtttttgtaaagtgtggtcccaa 
tStatcttgctctgtaatctcgtttcaaaaacaaggtgtttttgtaaagtgtggtcccaa 

caggaggacaattcatgaacttcgcatcaaaagacaattctttatacaacagtgctaaaa 
caggaggacaattcatgaacttcgcatcaaaagacaattctttatacaacagtgctaaaa 
caggaggacaattcatgaacttcgcatcaaaagacaattctttatact^acagtgcttvaaa 



ATG ATG ATG 



Figure 10 



Protein comparison 

BBSHR 

Homo sapiens 

MASGDTLYIATDGSEMPAEIVELHEIEVETIPVETIETTVVGEEE^DEDDEDGGGGDHGGGGGjERGHAGHHHHHHHHHH 
MASGDTLYIATDGSEMPAEIVELHEIEVETIPVETIETTVVGEEEpDDEDDEDGGGGDHGGGGGH-jGHAGHHHHHHHHHH 
MASGDTLYIATDGSEMPAEIVELHEIEVETIPVETIETTVVGEEE^Dl^DDEDGGGGDHGGGGGgHGHAGHHHHHHHHHH 

P^PPMIALQPLVTDDPTQVHHHQEVILVQTREEVVGGDDSDGLRAEDGFEDQILIPVPAPAGGDDDYIEQTLVTVAAAGK 
|~jPPMIALQPLVTDDPTQVHHHQEVILVQTREEVVGGDDSDGLRAEDGFEDQILIPVPAPAGGDDDYIEQTLVTVAAAGK 
HBPPMIALQPLVTDDPTQVHHHQEVILVQTREEVVGGDDSDGLRAEDGFEDQILIPVPAPAGGDDDYIEQTLVTVAAAGK 

SGGG^SSGGGRVKKGGGKKSGKKSYLG^GAGAAGGGGADPGNKKWEQKQVQIKTLEGEpjSVTMWSSDEKKDIDHETVVEE 
SGGGJs , SSGGGRVKKGGGKKSGKKSYLG^GAGAAGGGGADPGNKKWEQKQVQIKTLEGE|E|SVTMWSSDEKKDIDHETVVEE 
SGGGySSGGGRVKKGGGKKSGKKSYLGjGGAGAAGGGGADPGNKKWEQKQVQIKTLEGEbsVTMWSSDEKKDIDHETVVEE 

QIIGENSPPDYSEYMTGKKLPPGGIPGIDLSDPKQLAEFARMKPRKIKEDDAPRTIA 
QIIGENSPPDYSEYMTGKKLPPGGIPGIDLSDPKQLAEFARMKPRKIKEDDAPRTIA 
QIIGENSPPDYSEYMTGKKLPPGGIPGIDLSDPKQLAEFARMKPRKIKEDDAPRTIA 

*Zinc finger 

cphkgctkmfrdnsamSkhlhthgprvhvcaecgkafvessklkrhqlvhtgekpfqctfegcgkrfsldfnlrthvSih 
cphkgctkyfrdnsamkkhlhthgprvhvcaecgkafvessklkrhqlvhtgekpfqctfegcgkrfsldfnlrthvfelh 
cphkgctkyfrdnsamrkhlhthgprvhvcaecgkafvessklkrhqlvhtgekpfqctfegcgkrfsldfnlrthvglh 

Zinc finger* 
TGDRPYVCPFDGCNKKFAQSTNLKSHILTH 
TGDRPYVCPFDGCNKKFAQSTNLKSHILTH 
TGDRPYVCPFDGCNKKFAQSTNLKSHILTH 

AKAKNNQ* 
AKAKNNQ* 
AKAKNNQ* 



n=411/414 
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YY1 primers 



Designation 


Position 


Primer sequences 


K823-F (Promotor) 




CAC AGG CGT TTC TCG TCA GAG 


K825-R (Promotor) 




AAT ACC AAC TCC TCA ACC CCG A 


K884-F 


-104 


CTT CCT CCC TCT GCC TTC CTT 


K801-F 


55-75 


GAG ATC GTG GAA CTG CAT GAG 


K827-R 


127-150 


GTC TTC GTC GTC GTC CTC CTC CTC 


K814-F 


417-437 


CGG AGA CGA CGA CTA CAT CGA 


K806-R 


428-452 


GTG ACC AGC GTC TGC TCG ATG TAG T 


K804-R 


529-550 


CCA GGT AAC TCT TCT TGC CGC 


K805-R 


589-610 


G TT CCC ACT TCT TAT TAC CCG G 


K828-F 


627-648 


CAA GAC CCT GGA GGG CGA GTT C 


K830-F 


697-721 


ACA GTG GTT GAA GAG CAG ATC ATT G 


K829-R 


690-722 


CCA ATG ATC TGC TCT TCA ACC AC 


K831-F 


839-866 


GCC AAG AAA AAT TAA AGA AGA TGA TGC 


K832-R 


856-881 


GCT ATT GTT CTT GGA GCA TCA TCT TC 


K815-F 


997-1021 


GAG AGC TCA AAG CTA AAA CGA CAC C 


K833-R 


1026-1050 


AAA GGG CTT TTC TCC AGT ATG AAC C 


K817-R 


1077-1099 


AAT TGA AGT CCA GTG AAA AGG GC 


K816-F 


1105-1126 


ACG CAT GTG CGA ATC CAT ACC G 


K870-R 


1346-1372 


CAA AAC ATG TCC CTT AAG TGT GTA GGA 


K818-R 


1501-1528 


AAT TGT AAG CAA CAG GTG AGC TTC ATG 


K821-F (Intron 3) 




GCG AAG CAC CCC CAC ACT AAA TTT C 


K874-F (Intron 3) 




GCT TAT AAG TGC TGT TGG CTA CAG CT 


K875-R (Intron 3) 




GTC ACC TGG AGC TGT AGC CAA C 



Fl 


10 


1 


TCACTGGACTTCAATTTGCGC 


(1084) 


F2 


12 


1 


TTTTCACTGGACTTCAATTTGCG 


(1081) 


F3 


12 


1 


ACCAGATCCTCATTCCGGTACC 


(383) 


F4 


14 


1 


CCCTTTCAGTGCACATTCGAA 


(1045) 


F5 


15 


1 


GAC GAC G AAG AC GAC GAG GAT 


(139) 


F6 


17 


1 


GAGAGCTCAAAGCTAAAACGACACC 


(997) 


F7 


19 


1 


GGAGACGACGACTACATCGAGC 


(418) 


F8 


22 


1 


CG G AGACG AC GAC TAC AT C GA 


(417) 


F9 


23 


1 


T GAG AG C T C AAAG C T AAAAC GAC AC 


(996) 


F10 


24 


1 


GAGGACCAGATCCTCATTCCG 


(379) 


Fll 


26 


1 


AACTCCCTCCTGGAGGGATACC 


(767) 


F12 


27 


1 


GAG ACG AC GAC TAC AT C G AGC AG 


(419) 


F13 


29 


2 


GAG GAG GAC GAC GAC G AAG AC 


(130) 


F14 


30 


1 


TTGAGAGCTCAAAGCTAAAACGACA 


(995) 


F15 


30 


1 


ACCCTCTACATTGCCACGGAC 


(16) 


F16 


30 


1 


AC TAC AT C GAG C AG AC GC T GGT 


(428) 


F17 


34 


1 


GAGCTCAAAGCTAAAACGACACCA 


(999) 


F18 


35 


1 


TTCAGTGCACATTCGAAGGCT 


(1049) 


F19 


36 


2 


TGG AG AC TAT C GAG AC CAC GGT 


(98) 


F20 


37 


1 


TTTCAGTGCACATTCGAAGGC 


(1048) 


F21 


39 


1 


GTGCGAATCCATACCGGAGAC 


(1111) 


F22 


41 


1 


GAGGTGATTCTGGTGCAGACG 


(301) 


F23 


41 


1 


ACTCCCTCCTGGAGGGATACCT 


(768) 


F24 


41 


2 


GTG GAG AC T AT CG AG AC CAC G G 


(97) 


F25 


41 


1 


AGAGGTGATTCTGGTGCAGACG 


(300) 


F2 6 


42 


1 


TGAAATCTCACATCTTAACACACGCT 


(1190) 


F27 


43 


2 


TACATCGAGCAGACGCTGGTC 


(430) 


F28 


43 


1 


ACGACTACATCGAGCAGACGCT 


(425) 


F29 


43 


1 


GAAACTCCCTCCTGGAGGGATAC 


(765) 


F30 


49 


1 


CTGCACAAAGATGTTCAGGGATAAC 


(897) 
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F31 


52 


1 


AAAACG AC AC C AG C T GG T T C AT AC 


(1011 


F32 


52 


1 


TAAAACGACACCAGCTGGTTCATAC 


(1010 


F33 


53 


1 


AG AAG AG C G G C AAG AAG AG T T AC C 


(524 


F34 


53 


1 


AC C T G AAAT CT C AC AT C T T AACAC AC G 


(1187 


F35 


53 


1 


CCTGAAATCTCACATCTTAACACACG 


(1188 


F36 


55 


1 


GACACCAGCTGGTTCATACTGGA 


(1016 


F37 


55 


2 


G G T G GAG AC TAT C GAG AC C AC G 


(96) 


F38 


55 


1 


AGACGACGACTACATCGAGCAGA 


(420 


F39 


57 


1 


C AG T GG T T G AAG AG C AG AT CAT T G 


(698 


F4 0 


57 


1 


ACAGTGGTTGAAGAGCAGATCATTG 


(697 


F41 


57 


1 


GGTTGAAGAGCAGATCATTGGG 


(702 


F4 2 


58 


1 


GGTCCCAGAGTCCACGTCTGT 


(952 


F4 3 


59 


1 


T G C AC AAAG AT G T T C AG G G AT AAC T 


(898 


F4 4 


60 


1 


GATGCTCCAAGAACAATAGCTTGC 


(862 


F4 5 


62 


1 


GTCCCAGAGTCCACGTCTGTG 


( 953 


F4 6 


67 


1 


GCTTTTCACTGGACTTCAATTTGC 


(1079 


F4 7 


67 


1 


AG T G G T T G AAG AG C AG AT CAT T GG 


(699 


F4 8 


67 


1 


G T G G T T G AAG AGC AG AT CAT T G G 


(700 


F4 9 


71 


1 


AG AG C G G C AAG AAG AG T T AC C T G 


(527 


F50 


71 


1 


T C AC AT C T T AAC AC AC G C T AAAG C C 


(1197 


F51 


72 


1 


ATCTCACATCTTAACACACGCTAAAGC 


(1194 


F52 


73 


1 


C T G AAAT C T C AC AT C T T AAC AC AC G C 


(1189 


F53 


74 


1 


ACGACACCAGCTGGTTCATACTG 


(1014 


F54 


76 


1 


AG AT AT T G AC CAT G AAAC AG T G G T T G A 


( 681 


F55 


76 


1 


G AT AT T G AC C AT G AAAC AG T G G T T G A 


( 682 


F56 


76 


1 


GAGGGATACCTGGCATTGACCT 


(779 


F57 


77 


1 


AGACCATCCCGGTGGAGACTAT 


(86) 


F58 


78 


1 


GAAGAG C G G CAAGAAGAG T T AC C T 


(525 


F59 


78 


3 


G GAG AC T AT C GAG AC C AC GG T G 


(99) 


F60 


82 


1 


GGTTCGAGGACCAGATCCTCA 


(374 


F61 


83 


2 


GAGCAGATCATTGGGGAGAACTC 


(709 


F62 


87 


1 


GAAGATGATGCTCCAAGAACAATAGC 


(856 


F63 


88 


1 


C G C T AAAG C C AAAAAC AAC C AG T 


(1212 


F64 


88 


1 


ATACCGGAGACAGGCCCTATGT 


(1121 


F65 


89 


1 


CAATAGCTTGCCCTCATAAAGGC 


(875 


F66 


89 


1 


AAGATATTGACCATGAAACAGTGGTTG 


( 680 


F67 


89 


1 


ACAATAGCTTGCCCTCATAAAGGC 


(874 


F68 


92 


1 


AG AAAAG C C C T T T C AG T GC AC A 


(1038 


F69 


95 


1 


AT AT T G AC CAT G AAAC AG T G G T T G AAG 


( 683 


F7 0 


95 


1 


G C G G CAAGAAGAG T T AC C T G G 


(530 


F71 


95 


1 


TAT T G AC CAT G AAAC AG T G G T T G AAG 


( 684 


F72 


95 


1 


AT T G ACC AT G AAAC AG T G G T T G AAG 


( 685 


F73 


95 


1 


T T G AC CAT G AAAC AG T G G T T G AAG 


( 686 


F74 


96 


1 


GAACAATAGCTTGCCCTCATAAAGG 


(872 


F75 


96 


1 


AGAACAATAGCTTGCCCTCATAAAGG 


(871 


F7 6 


98 


1 


ACCTCTCAGACCCCAAGCAACT 


(797 


F77 


99 


1 


AC GC T AAAG CC AAAAAC AAC C AG 


(1211 


F78 


101 


1 


AAG AT G AT G C T C C AAG AAC AAT AG C T T 


(857 


F7 9 


101 


1 


AAACGACACCAGCTGGTTCATACT 


(1012 


F80 


102 


1 


CGACGGTTGTAATAAGAAGTTTGCTC 


(1152 


F81 


103 


1 


C AAG AAC AAT AG C T T GC CC T CAT AAA 


(869 


F82 


103 


1 


GGAACAGAAGCAGGTGCAGATC 


( 606 


F83 


106 


1 


AAAAG C C C T T T C AG T G C AC AT T C 


(1040 


F84 


106 


2 


TCTGCTATGAGAAAGCATCTGCAC 


( 922 


F85 


106 


1 


AAAC AG T G G T T GAAGAG C AG AT CAT T 


(695 


F86 


106 


1 


TTCGACGGTTGTAATAAGAAGTTTGC 


(1150 


F87 


106 


1 


AGCGTTCGTTGAGAGCTCAAAG 


(987 


F88 


106 


1 


GCCCCTTCGACGGTTGTAATA 


(1145 


F89 


107 


1 


CAACTGGCAGAATTTGCCAGA 


(814 


F90 


107 


1 


AGTTCTCGGTCACCATGTGGTC 


(644 


F91 


108 


1 


TGAGAAAGCATCTGCACACCC 


(929 
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F92 


108 


1 


AT GAG AAAG CAT C T G C AC AC CC 


(928) 


F93 


108 


1 


TAT G AG AAAG CAT C T G C AC ACC C 


(927) 


F94 


111 


1 


GAGTTCTCGGTCACCATGTGGT 


(643) 


F95 


111 


1 


C AC C AC C AC C AAG AG G T GAT T C 


(289) 


F96 


111 


1 


G AC G ACG AC T AC AT C GAG C AG AC 


(421) 


F97 


112 


1 


CCCGGTGGAGACTATCGAGAC 


(93) 


F98 


112 


1 


CAGAAGCAGGTGCAGATCAAGAC 


(610) 


F99 


112 


1 


G C T AAAG C C AAAAAC AAC C AG T G A 


(1213) 


F100 


113 


1 


G AC C T C T C AG AC C C C AAG C AA 


(796) 


Rl 


3 


1 


GCAAACTTCTTATTACAACCGTCGAA 


(1175) 


R2 


4 


1 


ACATAGGGCCTGTCTCCGGTAT 


(1142) 


R3 


4 


1 


AGCAAACTTCTTATTACAACCGTCGA 


(1176) 


R4 


6 


1 


AGCTTTGAGCTCTCAACGAACG 


(1010) 


R5 


8 


1 


GAGCAAACTTCTTATTACAACCGTCG 


(1177) 


R6 


8 


1 


CTTTGAGCTCTCAACGAACGCT 


(1008) 


R7 


9 


1 


GGTTGTTTTTGGCTTTAGCGTGT 


(1231) 


R8 


10 


1 


CTGGTTGTTTTTGGCTTTAGCGT 


(1233) 


R9 


11 


1 


CCTGTCTCCGGTATGGATTCG 


(1134) 


RIO 


12 


1 


CTGTCTCCGGTATGGATTCGC 


(1133) 


Rll 


12 


1 


GTCTCCGGTATGGATTCGCAC 


(1131) 


R12 


12 


1 


AGCGTCTGCTCGATGTAGTCGT 


(446) 


R13 


13 


1 


TTCTGTTCCCACTTCTTATTACCCG 


(614 ) 


R14 


15 


2 


TCTGCTCGATGTAGTCGTCGTCT 


(442) 


R15 


15 


1 


ACTGGTTGTTTTTGGCTTTAGCG 


(1234) 


R16 


16 


4 


GTCTGCTCGATGTAGTCGTCGTC 


(443) 


R17 


16 


4 


ATCCTCGTCGTCTTCGTCGTC 


(159) 


R18 


17 


1 


CAGTATGAACCAGCTGGTGTCGT 


(1036) 


R19 


17 


1 


TTGAGCTCTCAACGAACGCTTT 


(1006) 


R20 


19 


1 


AGACCACATGGTGACCGAGAAC 


(666) 


R21 


19 


1 


CTTCTTATTACCCGGGTCGGC 


(603) 


R22 


20 


1 


CTGCTCGATGTAGTCGTCGTCTC 


(441) 


R23 


21 


1 


TCGATGTAGTCGTCGTCTCCG 


(437) 


R24 


22 


1 


TTTGAGCTCTCAACGAACGCTT 


(1007) 


R25 


22 


1 


CCACTTCTTATTACCCGGGTCG 


(606) 


R2 6 


22 


1 


CACTTCTTATTACCCGGGTCGG 


(605) 


R27 


22 


1 


GACCAGCGTCTGCTCGATGTA 


(450) 


R28 


23 


1 


AATTGAAGTCCAGTGAAAAGCGC 


(1099) 


R2 9 


23 


1 


TGAACCAGCTGGTGTCGTTTTAG 


(1031) 


R30 


23 


1 


G AC C AC AT G G T G AC C GAG AAC T 


(665) 


R31 


25 


1 


AACTTCTTATTACAACCGTCGAAGGG 


(1172) 


R32 


25 


1 


TGTTCCCACTTCTTATTACCCGG 


(611) 


R33 


26 


3 


CCCAGGTAACTCTTCTTGCCG 


(551) 


R34 


26 


1 


AGAGGTCAATGCCAGGTATCCC 


(802) 


R35 


26 


2 


CCAGGTAACTCTTCTTGCCGC 


(550) 


R36 


27 


1 


TTGAAGTCCAGTGAAAAGCGCT 


(1097) 


R37 


29 


1 


TGAGGATCTGGTCCTCGAACC 


(394 ) 


R38 


29 


1 


CACATGGTGACCGAGAACTCG 


(662) 


R39 


29 


1 


GTATGAACCAGCTGGTGTCGTTTT 


(1034 ) 


R4 0 


30 


1 


TCAATCTCATGCAGTTCCACGAT 


(80) 


R41 


30 


1 


TCAATCTCATGCAGTTCCACGA 


(80) 


R42 


30 


1 


AGTATGAACCAGCTGGTGTCGTTT 


(1035) 


R4 3 


32 


1 


GGTCTCGATAGTCTCCACCGG 


(114 ) 


R4 4 


33 


1 


AAGACCACATGGTGACCGAGAA 


(667) 


R4 5 


33 


1 


C AAT C T CAT G C AGT T C C AC GAT C 


(79) 


R4 6 


34 


1 


GGAATGAGGATCTGGTCCTCG 


(398) 


R47 


34 


2 


TTCCCACTTCTTATTACCCGGGT 


(609) 


R4 8 


34 


2 


TGAAGTCCAGTGAAAAGCGCTT 


(1096) 


R4 9 


35 


1 


GCTCGATGTAGTCGTCGTCTCC 


(439) 
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R50 


36 


1 


GTATGAACCAGCTGGTGTCGTTTTA 


(1034) 


R51 


36 


1 


TTCCCACTTCTTATTACCCGGG 


(609) 


R52 


42 


1 


GAATGAGGATCTGGTCCTCGAAC 


(397) 


R53 


43 


1 


GAGGTCAATGCCAGGTATCCCT 


(801) 


R54 


44 


1 


GTGGTCTCGATAGTCTCCACCG 


(116) 


R55 


44 


1 


AGGTAACTCTTCTTGCCGCTCTTC 


(548) 


R56 


45 


1 


C AC AT T C T G C AC AG AC G T GG A 


(982) 


R57 


46 


1 


AAAGGGCTTTTCTCCAGTATGAACC 


(1050) 


R58 


47 


2 


ACCATCCTCGTCGTCTTCGTC 


(162) 


R59 


48 


1 


GCTTCTGTTCCCACTTCTTATTACCC 


(616) 


R60 


48 


1 


CACATTCTGCACAGACGTGGAC 


(982) 


R61 


49 


1 


CAGGTAACTCTTCTTGCCGCTCT 


(549) 


R62 


49 


1 


GATGCTTTCTCATAGCAGAGTTATCCC 


(940) 


R63 


49 


1 


CTGAAGACCACATGGTGACCG 


(670) 


R64 


50 


2 


CCTGCTTCTGTTCCCACTTCTTATTAC 


(619) 


R65 


52 


1 


ACCAGCGTCTGCTCGATGTAGT 


(449) 


R66 


53 


1 


TCTTATTACAACCGTCGAAGGGG 


(1168) 


R67 


53 


1 


TTGTTTTTGGCTTTAGCGTGTGTT 


(1229) 


R68 


54 


1 


ACTGAAAGGGCTTTTCTCCAGTATG 


(1054) 


R69 


55 


1 


CACTGAAAGGGCTTTTCTCCAGTAT 


(1055) 


R70 


57 


1 


GAGGTGAGTTCTCCCCAATGATC 


(736) 


R71 


58 


1 


GGTACCGGAATGAGGATCTGGT 


(404) 


R72 


62 


1 


GTCTCGATAGTCTCCACCGGG 


(113) 


R7 3 


63 


1 


CTTCAACCACTGTTTCATGGTCAATA 


(709) 


R7 4 


64 


1 


CCTTTATGAGGGCAAGCTATTGTTC 


(896) 


R7 5 


64 


1 


CTTCAACCACTGTTTCATGGTCAATAT 


(709) 


R7 6 


65 


1 


TTTTTGGCTTTAGCGTGTGTTAAGAT 


(1226) 


R77 


66 


1 


TTGTTTTTGGCTTTAGCGTGTGTTA 


(1229) 


R7 8 


69 


1 


CTTGGGGTCTGAGAGGTCAATG 


(813) 


R7 9 


71 


1 


GTCCGTGGCAATGTAGAGGGT 


(36) 


R8 0 


71 


1 


TCTGGCAAATTCTGCCAGTTG 


(834) 


R81 


72 


1 


TCACTGGTTGTTTTTGGCTTTAGC 


(1236) 


R82 


72 


1 


CTTTGTGCAGCCTTTATGAGGG 


(906) 


R83 


73 


1 


GTTGTTTTTGGCTTTAGCGTGTGT 


(1230) 


R8 4 


74 


1 


TGAAAGGGCTTTTCTCCAGTATGA 


(1052) 


R85 


75 


1 


GCAAGCTATTGTTCTTGGAGCATC 


(885) 


R8 6 


76 


1 


GCCTTTATGAGGGCAAGCTATTG 


(897) 


R87 


76 


1 


GCTTGGGGTCTGAGAGGTCAAT 


(814 ) 


R8 8 


76 


1 


CCAATGATCTGCTCTTCAACCAC 


(722) 


R8 9 


77 


1 


CCACCGTGGTCTCGATAGTCTC 


(121) 


R90 


78 


1 


CTGCTTCTGTTCCCACTTCTTATTACC 


(618) 


R91 


80 


1 


TTGGCTTCATTCTGGCAAATTC 


(844) 


R92 


81 


1 


CTTCAACCACTGTTTCATGGTCAAT 


(709) 


R93 


81 


1 


AATCTCATGCAGTTCCACGATCTC 


(78) 


R94 


82 


1 


CTTCAACCACTGTTTCATGGTCAA 


(709) 


R95 


83 


1 


GGGCTTTTCTCCAGTATGAACCA 


(1047) 


R96 


83 


2 


ACCACATGGTGACCGAGAACTC 


(664) 


R97 


84 


1 


GTGCAGATGCTTTCTCATAGCAGA 


(945) 


R98 


84 


1 


TGTGCAGATGCTTTCTCATAGCAG 


(946) 


R99 


85 


1 


CATTCTGCACAGACGTGGACTC 


(980) 


R100 


85 


1 


TCTGAGAGGTCAATGCCAGGTATC 


(806) 
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Shortened zinc finger of BB.6S 



1. Nucleic acid sequence of the "shortened" zinc finger for BB.6S 
(the nucleotides that are crossed out are absent, only the 
underlined nucleotides code for amino acids of the protein in the 
second band) 



1 ATGGCCTCGG GCGACACCCT CTACATTGCC ACGGACGGCT CGGAGATGCC 

51 AGCCGAGATC GTGGAACTGC ATGAGATTGA GGTGGAGACC ATCCCGGTGG 

101 AGACTATCGA GACCACGGTG GTGGGCGAGG AGGAGGACGA CGACGAAGAC 

151 GACGAGGATG GTGGCGGCGG AGACCACGGT GGCGGGGGCG GCCACGGGCA 

201 CGCTGGCCAC CACCATCACC ACCACCACCA CCACCACCCG CCCATGATCG 

251 CGCTGCAGCC GCTGGTCACC GACGACCCGA CCCAAGTGCA CCACCACCAA 

301 GAGGTGATTC TGGTGCAGAC GCGCGAGGAG GTAGTGGGTG GCGACGACTC 

351 GGACGGGCTG CGCGCCGAGG ACGGGTTCGA GGACCAGATC CTCATTCCGG 

4 01 TACCCGCGCC GGCCGGCGGA GACGACGACT ACATCGAGCA GACGCTGGTC 

4 51 ACCGTGGCGG CGGCCGGCAA GAGCGGTGGC GGGTCTTCGT CGGGCGGCGG 

501 CCGCGTTAAG AAGGGCGGCG GCAAGAAGAG cGGCAAGAAG AGTTACCTGG 

551 GCAGCGGGGC CGGCGCGGCG GGCGGTGGCG GCGCCGACCC GGGTAATAAG 

601 AAGTGGGAAC AGAAGCAGGT GCAGATCAAG ACCCTGGAGG GCGAGTTCTC 

651 GGTCACCATG TGGTCTTCAG ATGAAAAAAA AGATATTGAC CAT G AAAC AG 

701 TGGTTGAAGA GCAGATCATT GGGGAGAACT CACCTCCTGA TTATTCTGAA 

7 51 TATATGACAG GCAAGAAACT CCCTCCTGGA GGGATACCTG GCATTGACCT 

8 01 CTCAGACCCC AAGCAACTGG CAGAATTTGC CAGAATGAAG CCAAGAAAAA 
8 51 TTAAAGAAGA TGATGCTCCA AGAACAATAG CTTGCCCTCA TAAA GGCTGC 
901 ACAAAGAtGT TCAGGGATAA CTCTGCTATG AgAAACCATC TGCACACCCA 
951 CCCTCCCAGA GTCCACGTCT CTGCACAATG TCGCAAAGCG TTCGTTGAGA 

1001 GCTCAAAGCT AAAACGACAC CAGCTGGTTC ATACTGGAGA AAAGCCCTTT 

1051 CAGTGCACAT TCGAAGGCTG CGGGAAGCGC TTTTCACTGG ACTTCAATTT 

1101 GCGCACGCAT GTGCGAATCC ATACCGGAGA CAGGCCCTAT GTGTGCCCCT 

1151 TCGACGGTTG TAATAAGAAG TTTGCTCAGT CAACTAACCT GAAATCTCAC 

1201 ATCTTAACAC ACGCTAAAGC CAAAAACAAC CAGTGA 



2. Protein sequence of the "shortened" zinc finger for BB.6S (the 
nucleotides that are crossed out are absent, only the underlined 
nucleotides are present in the second band) 

CPHK g ctkmfrdnsamrkhlhthgprvhv 

caecgkafv es sklkrhqlvhtgekpfq 

CTFE G C G KRFSLDFNLRTH VRIHTGDRP Y V 
CPFDGCNKKFAQSTNLKSHILTH 

n=57 



Figure 13 



Antisense oligonucleotides used in accordance with the present invention 



(The positions given relate to the coding region and not to the position 
numbering of the sequence listing.) 

>gi I 1835104 | embl Z85393. 1 IHSZ85393 H. sapiens Ig lambda light chain variable 
region gene ( 34-30SWIIE197 ) rearranged; Ig-Light-Lambda; VLambda 
Identities - 14/14 (100%) 

Query: 74 accggagcccgctg 88 
I I I I I I I I I I I I I I 

Sbjct: 241 accggagcccgctg 228 
V_region 1 . . 34 8 

>gi I 14589948 | ref | NM_000937 . 2 | Homo sapiens polymerase (RNA) II (DNA directed) 
polypeptide A, 220kDa ( POLR2A) , mRNA 
Identities = 17/18 (94%) 

Query: 77 ggagcccgctgtgggaga 94 
I I I I I I I I I I I I I I I I I 

Sbjct: 5099 ggagcccggtgtgggaga 5082 
CDS 387.. 6299 

>gi I 18596386 | ref |XM_093190. 1 | Homo sapiens similar to T-cell activation 
protein (LOC170226) , mRNA 
Identities = 20/21 (95%) 

Query: 76 cggagcccgctgtgggagatg 96 
I I I I I I I I I I I I I I I I I I I I 

Sbjct: 156 cggagcacgctgtgggagatg 176 
CDS 1 . . 1314 

>gi|7934571|gb|AF220152.2|AF220152 Homo sapiens TACC2 mRNA, complete cds 
Identities = 14/14 (100%) 

Query: 78 gagcccgctgtggg 91 
I I I I I I I I I I I I I I 

Sbjct: 977 gagcccgctgtggg 990 
CDS 293.. 3013 

>gi|6165844|gb|AF100772.1|AF100772 Homo sapiens tenascin-Ml (TNM1) mRNA, 
Identities = 16/16 (100%) 

Query: 7 9 agcccgctgtgggaga 94 

I I I I I I I I I I I I I I I I 
Sbjct: 4377 agcccgctgtgggaga 4362 

CDS 65.. 8242 

function="putative receptor molecule" 

>gi | 24496766 | ref | NM_004712 . 3 | Homo sapiens hepatocyte growth factor-regulated 
tyrosine kinase substrate (HGS), mRNA 
Identities = 15/15 (100%) 

Query: 79 agcccgctgtgggag 93 
I I I I I I I I I I I I I I I 

Sbjct: 2088 agcccgctgtgggag 2074 



1 



CDS 78. .2411 

/note="human growth factor-regulated tyrosine kinase 

>gi | 4 324 953 I gb | AF114 821 . 1 1 HSSM03 Homo sapiens smoothened (SMO) gene, exons 3 
through 12 and complete 
Identities = 14/14 (100%) 



Query : 7 9 agcccgctgtggga 93 

I I I I I I I I I I I I I I 
Sbjct: 6838 agcccgctgtggga 6825 
CDS 6833.. 6967 



>gi | 22064913 | ref I XM_09004 7 . 5 | Homo sapiens LOC160156 (LOC160156) , mRNA 
Identities = 14/14 (100%) 



Query: 80 gcccgctgtgggag 93 
I I I I I I I I I I I I I I 

Sbjct: 1427 gcccgctgtgggag 1440 
CDS 709.. 1980 



>gi I 517388 | emb| Z31606. 1 I HSB2N03 H. sapiens brain-2/N-Oct 3 gene (promoter 
region) 

Identities = 15/15 (100%) 

Query: 80 gcccgctgtgggaga 94 
I I I I I I I I I I I I I I I 

Sbjct: 547 gcccgctgtgggaga 561 
promoter 1 . .670 

>gi I 23273501 | gb | BC035782 . 1 | Homo sapiens, Similar to tyrosine kinase, non- 
receptor, 1, clone 
Identities = 15/15 (100%) 



Query: 83 cgctgtgggagatgt 97 
! I I I I I I I I I I I I I I 

Sbjct: 1032 cgctgtgggagatgt 1046 
CDS ~ 98. .2083 



>gi I 7662017 | ref I NM_015153 . 1 I Homo sapiens PHD finger protein 3 (PHF3), mRNA 
Identities = 14/14 (100%) 

Query: 84 gctgtgggagatgt 97 

I I I I I I I I I I I I I I 
Sbjct: 2228 gctgtgggagatgt 2241 

CDS 28. .6147 

>gi I 20561197 | ref |XM_062302 . 3 | Homo sapiens similar to RING finger protein 18 
(Testis-specif ic ring-finger protein) (LOC120826) , mRNA 
Score =28.2 bits (14), Expect = 119 
Identities = 14/14 (100%) 



Query: 84 gctgtgggagatgt 97 
I I I I I I I I I I I I I I 

Sbjct: 489 gctgtgggagatgt 502 
CDS 190.. 2244 

/product="similar to RING finger protein 18 (Testis-specif ic ring-finger rotein) " 

>gi I 4885330 | ref | NM_005305 . 1 | Homo sapiens G protein-coupled receptor 42 

(GPR42), mRNA 

Identities = 14/14 (100%) 

Query: 84 gctgtgggagatgt 97 

I I I I I I I I I I I I I I 
Sbjct: 486 gctgtgggagatgt 473 

CDS 1. . 1041 

>gi | 17473297 | ref |XM_061928 . 1 | Homo sapiens LOC120226 (LOC120226) , mRNA 
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Identities = 14/14 (100%) 



Query: 84 gctgtgggagatgt 97 
I I I I I I I I I I I I I I 

Sbjct: 453 gctgtgggagatgt 466 
CDS 1. .492 

>gi I 21359977 | ref I NM_024 94 7 . 2 I Homo sapiens polyhomeotic like 3 (Drosophila) 
(PHC3), mRNA 

Identities = 14/14 (100%) 

Query: 85 ctgtgggagatgta 97 

I I I I I I I I I I I I I I 
Sbjct: 404 ctgtgggagatgta 391 

CDS 65. .2959 

/note="early development regulator 3; polyhomeotic 3" 

>gi | 19718811 | gb | BC007249 . 2 | Homo sapiens, pelota homolog (Drosophila), clone 
Identities = 13/13 (100%) 

Query: 97 taacggtgcctgc 109 

I I I I I I I I I I I I I 
Sbjct: 1303 taacggtgcctgc 1291 

CDS 274. .1431 

/product="pelota homolog (Drosophila)" 

>gi| 19924298 |ref |NM_004958.2 | Homo sapiens FK506 binding protein 12-rapamycin 
associated protein l(FRAPl), mRNA 
Identities = 13/13 (100%) 

Query: 98 acggtgcctgccg 111 

I I I I I I 1 I I I I I I 
Sbjct: 5720 acggtgcctgccg 5732 

CDS 80. .7729 

/note="FK506 binding protein 12-rapamycin associated protein 2; rapamycin target 
protein; FKBP12-rapamycin complex-associated protein 1; FKBP-rapamycin 
associated 

>gi I 22049727 | ref | XM_040948 . 8 | Homo sapiens dynein, cytoplasmic, heavy 
polypeptide 1 (DNCH1), mRNA 
Identities = 13/13 (100%) 

Query: 99 acggtgcctgccg 111 

I I I I I I I I I I I I I 
Sbjct: 6291 acggtgcctgccg 6303 

CDS 6.. 6830 

/product^'similar to cytoplasmic dynein heavy chain" 

>gi | 4 59833 I gb I L25085 . 1 I HUMSEC61B Human Sec61-complex beta-subunit mRNA, 
Identities = 14/14 (100%) 

Query: 100 cggtgcctgccgag 113 

I I I I I I I I I I I I I I 
Sbjct: 220 cggtgcctgccgag 207 

CDS 64 . .354 

/f unction="protein translocation across the er-membrane" 

>gi I 21704280 | ref I NM_0204 33 . 2 | Homo sapiens junctophilin 2 ( JPH2) , mRNA 
Identities = 17/18 (94%) 
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Query: 100 cggtgcctgccgagcctc 117 

I I I I I I I I I I I I I I I I I 
Sbjct: 2526 cggtgcctgcagagcctc 2509 

CDS 874 . .2964 

mediate cross talk between cell surface and intracellular ion channels. 

>gi|12804622|gb|BC001734.1|BC001734 Homo sapiens, protein translocation 
complex beta, clone MGC:1255 
Identities = 14/14 (100%) 



Query: 100 cggtgcctgccgag 113 

I I I I I I I I I I I I I I 
Sbjct: 233 cggtgcctgccgag 220 

CDS 77.. 367 

>gi I 22053428 | ref | XM_03814 6 . 5 | Homo sapiens interferon, gamma-inducible protein 
30 (IFI30), mRNA 
Identities = 16/17 (94%) 



Query: 100 cggtgcctgccgagcct 116 

I I I I I I I I I I I I I I I I 
Sbjct: 289 cggtggctgccgagcct 305 

CDS 74.. 826 

/product="similar to Gamma-interf eron inducible lysosomal thiol reductase 
precursor (Gamma-interf eron-inducible protein IP-30) " 

>gi I 7657145 | ref I NM_014365 . 1 I Homo sapiens protein kinase Hll (Hll), mRNA 
Identities = 13/13 (100%) 



Query: 101 ggtgcctgccgag 113 
I I I I I I I I I I I I I 

Sbjct: 763 ggtgcctgccgag 775 

CDS 524.. 1114 

/note= H contains hsp20/crystallin family domain; est radiol-induced; small stress 
protein-like protein HSP22 " 

>gi I 21264316 | ref | NM_014599 . 3 | Homo sapiens melanoma antigen, family D, 2 
(MAGED2), mRNA 
Identities = 17/18 (94%) 



Query: 101 ggtgcctgccgagcctct 118 

I I I I I I I I I I I I I I I I I 
Sbjct: 332 ggt gcctcccgagcctct 315 

CDS 97.. 1917 

/note="hepatocellular carcinoma associated protein; breast cancer associated 
gene 1" 

>gi|11967745|emb|AJ293618.1|HSA293618 Homo sapiens mRNA for hypothetical 

protein 11B6, clone XP11B6 

Identities = 17/18 (94%) 

Query: 101 ggtgcctgccgagcctct 118 

I I I I I I I I I I I I I I I I I 
Sbjct: 248 ggtgcctcccgagcctct 231 

CDS 13.. 1833 

>gi 120514781 I ref | NM_139015 . 1| Homo sapiens SPPL3 (SPPL3) , mRNA 
Identities = 14/14 (100%) 



Query: 104 gcctgccgagcctc 117 

I I I I I I I I I I I I I I 
Sbjct: 569 gcctgccgagcctc 582 

CDS 1. .1905 

/product="hypothetical protein XP_068909" 
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>gi j 23094 385 I emb I AJ34 5030 . 1 I HSA34 5030 Homo sapiens mRNA for presenilin-like 
protein 4 (PSL4 gene) 
Identities = 14/14 (100%) 

Query: 104 gcctgccgagcctc 117 

I I I II I I I I I I I I I 
Sbjct: 598 gcctgccgagcctc 611 

CDS 54.. 1208 

/function="putative intramembrane protease" 

>gi I 4502840 | ref |NM__003654 . 1 | Homo sapiens carbohydrate (keratan sulfate Gal-6) 
sulfotransf erase 1 (CHST1), mRNA 
Identities = 16/17 (94%) 

Query: 105 cctgccgagcct ctacg 121 

I I I II I I I I I I I I I I I 
Sbjct: 735 cctgcggagcct ctacg 751 

CDS 367.. 1602 

/note="carbohydrate (chondroitin 6/keratan) sulfotransf erase 1" 
/pr o t e i n_id= " N P_0 03 64 5.1" 

>gi I 17485022 |ref |XM_066361. 1 | Homo sapiens similar to glutathione S- 
transferase theta 1 (LOC129041), mRNA 
Identities - 13/13 (100%) 

Query: 105 cctgccgagcctc 117 

I I I I I I I I I I I I I 
Sbjct: 580 cctgccgagcctc 568 

CDS 1..633 
/product="similar to glutathione S-transf erase theta 1" 

>gi I 22134527 | gb | AF331523 . 1 | Homo sapiens chromosome 12 putative anion 
transporter mRNA, partial 
Identities = 13/13 (100%) 

Query: 106 ctgccgagcctct 118 

I I I II I I I I I I I I 
Sbjct: 807 ctgccgagcctct 819 

CDS <312..2003 
note="member of the SLC26 family; 

>gi I 14784297 | ref | XM_031102 . 1 | Homo sapiens Breakpoint cluster region protein, 
uterine leiomyoma, 2 (BCRP2), mRNA 
Identities = 13/13 (100%) 

Query: 108 gccgagcctctac 120 

I I I I I I I I I I I I I 
Sbjct: 1784 gccgagcctctac 1796 

CDS 55.. 2883 

/product="similar to KIAA1824 protein" 

>gi I 20535791 | ref | XM_1 19632 . 1 | Homo sapiens LOC205318 (LOC205318), mRNA 
Identities = 14/14 (100%) 

Query: 123 tcggctctagcacc 136 

I II I I I I I I I I I I I 
Sbjct: 491 tcggctctagcacc 504 

CDS 1..522 
/product="hypothetical protein XP_119632" 
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>gi I 4557448 I ref I NM_001271 . 1 1 Homo sapiens chromodomain helicase DNA binding 
protein 2 (CHD2), mRNA 
Identities = 14/14 (100%) 



Query: 125 ggctctagcacctt 138 
I I I I I I I I I I I I I I 

Sbjct: 2791 ggctctagcacctt 2778 
CDS 708.. 5927 

/product=" chromodomain helicase DNA binding protein 2" 



>gi|21264574 | ref I NM_139135 . 1 I Homo sapiens SWI/SNF related, matrix associated, 
actin dependent regulator of chromatin, subfamily f, member 1 (SMARCF1), 
transcript variant 2, mRNA 
Identities = 14/14 (100%) 



Query: 128 t ctagcaccttgac 141 

I I I I I I I I I I I I I I 
Sbjct: 5474 tctagcacct tgac 5487 

CDS 371.. 6577 

/note="brain protein 120; chromosome 1 open reading frame 4; SWI/SNF complex 
protein p270; BRGl-associated factor 250a; chromatin remodeling factor p250; 
OSA1 nuclear protein" 

>gi I 18129612 | gb | AF333072 . 2 | AF333072 Homo sapiens HERV-K18.1 5' long terminal 
repeat, complete sequence; gag protein (gag) gene, gag-K18.1 allele, complete 
cds; pol protein (pol) gene, pol-K18.1 allele, complete cds; env protein (env) 
gene, env-K18.1 allele, complete cds; and 3' long terminal repeat, complete 
sequence 

Identities = 17/18 (94%) 



Query: 143 tactctaactccacctct 160 

I I I I I I ! I I I I i Mill 
Sbjct: 1196 tactctaactccccctct 1179 

CDS 1113.. 1874 

>gi I 4503778 | ref | NM_002029 . 1 | Homo sapiens formyl peptide receptor 1 (FPR1), 
Identities = 14/14 (100%) 



Query: 148 taactccacctctg 161 

I I I I I I I I I I I I I I 
Sbjct: 1102 taactccacctctg 1089 

CDS 62. .1114 

/product="f ormyl peptide receptor 1" 
/product="protein tyrosine kinase-7" 

>gi I 21655145 | gb | AY082886 . 1 | Homo sapiens eukaryotic translation initiation 
factor 4GI (EIF4GI) 
Identities = 14/14 (100%) 



Query: 150 actccacctctggt 163 
I I I I I I I I I I I I I I 

Sbjct: 1400 actccacctctggt 1387 
CDS 275. . 5077 

>gi I 13357213 I ref I NM_01554 5 . 1 I Homo sapiens KIAA0632 protein (KIAA0632), mRNA 
Identities = 17/18 (94%) 



Query: 152 tccacctctggtagggcc 169 

I I I I I I I I I I I I I I I I I 
Sbjct: 931 tccacctctggttgggcc 914 

CDS 282.. 1790 

/product="KIAA0632 protein" 



6 



>gi I 18182862 I gb I BC015632 . 1 1 Homo sapiens, similar to hypothetical protein 
XP_166541, clone 
Identities = 18/19 (94%) 



Query: 153 ccacctctggtagggccac 171 

I I I I I I I I I I I I I I I I I I 
Sbjct: 2098 ccacct cgggtagggccac 2080 
CDS 52.. 2517 



>gi | 24432033 | ref I NM_004 959 . 3 I Homo sapiens nuclear receptor subfamily 5, group 
A, member 1 (NR5A1), 
Identities = 17/18 (94%) 



Query: 159 ctggtagggccacctctg 176 
I I I I I I I I I I I I I I I I I 

Sbjct: 2316 ct gggagggccacctctg 2299 
CDS 52.. 13776 



>gi j 17432414 | gb | AF447167 . 1 | F447157S05 Homo sapiens protein tyrosine kinase-7 
(PTK7) gene, exons 11, 12, and 13 
Identities = 15/15 (100%) 



Query: 161 ggtagggccacctct 175 
I I I I I I I I I I I I I I I 

Sbjct: 553 ggtagggccacctct 567 
CDS AF447164.1: 548.. 683 



>gi I 19882212 | ref | NM_032119. 1 | Homo sapiens very large G protein-coupled 
receptor 1 (VLGR1), mRNA 
Identities = 17/18 (94%) 



Query: 170 acctctgatagctctggt 187 
I I I I I I I I I I I I I I I I I 

Sbjct: 11300 acctctgataactctggt 11283 
CDS 97 . . 19020 

>gi|22044294 | ref | XM_174 4 4 9 . 1 | Homo sapiens LOC255281 (LOC255281), mRNA 
Identities = 16/16 (100%) 



Query: 180 cctctgatagctctgg 196 

I I I I I I 1 I I I I I I I I I 
Sbjct: 460 cctctgatagctctgg 445 
CDS 1 . . 474 



>gi I 1841544 | gb| U89337 . 1 | HSMHC3W36A Homo sapiens HLA class III region 
containing NG7, cAMP response element binding protein-related protein (CREB-RP) , 
and tenascin X genes, 
Identities = 17/18 (94%) 



Query: 180 gctctggtgccaccaccc 197 
I I I I I I I I I I I I I I I I I 

Sbjct: 54778 gctctggtgcctccaccc 54795 
CDS 54566.. 54883 



>gi | 6164703 | gb | AF167572 . 1 | AF167572 Homo sapiens protein methyltransf erase 
(JBP1) mRNA, complete cds 
Identities = 14/14 (100%) 
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Query: 181 ctctggtgccacca 194 
I I I I I I I I I I I I I I 

Sbjct: 574 ctctggtgccacca 561 

CDS 92.. 2005 

/f unction="methylates histories H2A and H4 and myelin basic 
/product=" protein methyl transferase" 



gi|2323409|gb|AF015913.1|AF015913 Homo sapiens SKBlHs mRNA, complete cds 
Identities = 14/14 (100%) 



Query: 181 ctctggtgccacca 194 

I I I I I I I I I I I I I I 
Sbjct: 483 ctctggtgccacca 470 

CDS 1..1914 
/note="homolog of fission yeast Skbl" 

>gi I 18490998 | ref I NM_003882 . 2 | Homo sapiens WNT1 inducible signaling pathway 
protein 1 (WISP1 ), transcript variant 1, mRNA 
Identities = 18/19 (94%) 



Query: 181 ctctggtgccaccacccgc 199 
I I I I I I I I I I I I I I I I I I 

Sbjct: 569 ctctggtgcccccacccgc 587 
CDS 77.. 1180 



>gi I 14245731 |dbj IAB051853. 1 | Homo sapiens ARHGAP9 gene for rho-GTPase 
activating protein, complete 
Identities = 15/15 (100%) 



Query: 181 ctctggtgccaccac 195 

I I I I I I I I I I I I I I I 
Sbjct: 2013 ctctggtgccaccac 2027 

CDS 140. . 2335 

/f unction="regulating adhesion of hematopoietic cells to 
extracellular matrix" 



>gi I 16876446 | ref |NM_054028 . 1 I Homo sapiens acyl-malonyl condensing enzyme 
(AMAC), mRNA 

•Identities = 14/14 (100%) 

Query: 181 ctctggtgccacca 194 
I I I I I I I I I I I I I I 

Sbjct: 220 ctctggtgccacca 233 
CDS 122.. 1138 



>gi I 18594399 | ref |XM_092954 . 1 | Homo sapiens similar to acidic protein rich i 
leucines (LOC164697), mRNA 
Identities = 14/14 (100%) 



Query: 182 tctggtgccaccac 195 

I I I I I I I I I I I I I I 
Sbjct: 721 tctggtgccaccac 734 
CDS 1. .2454 

/product="similar to acidic protein rich in leucines" 

>gi|20553841|ref | XM_031689 . 6 | Homo sapiens MAX dimerization protein 5 (MGA) 
mRNA 

Identities = 14/14 (100%) 



Query: 182 tctggtgccaccac 195 

I I I I I I I I I I I I I I 
Sbjct: 463 tctggtgccaccac 476 

CDS 82 . . 4773 

/product="similar to MAX-interacting protein" 

>gi | 18575418 | ref |XM_100074 . 1 | Homo sapiens LOC159480 (LOC159480) , mRNA 
Identities = 14/14 (100%) 

Query: 183 ctggtgccaccacc 196 

I I I I I I I I I I I I I I 
Sbjct: 475 ctggtgccaccacc 488 

CDS 1..1425 
/product="hypothetical protein XP_100074" 

>gi I 6912511 1 ref I NM_012330 . 1 1 Homo sapiens monocytic leukemia zinc finger 
protein-related factor (MORF) , mRNA 
Identities = 14/14 (100%) 

Query: 183 ctggtgccaccacc 196 

I I I I I I I I I I I I I I 
Sbjct: 1535 ctggtgccaccacc 1548 

CDS 316.. 6537 

/note="alternatively spliced; histone acetyltransf erase" 
/product="monocytic leukemia zinc finger protein-related factor" 

>gi | 18032211 I gb I AF217500 . 1 IAF217500 Homo sapiens histone acetyltransf erase 
MOZ2 (MOZ2) mRNA, complete cds 
Identities = 14/14 (100%) 

Query: 183 ctggtgccaccacc 196 

I I I I I I I I I I I I I I 
Sbjct: 1708 ctggtgccaccacc 1721 

CDS 489.. 6707 

/note="MYST family member; similar to MOZ" 
/product="histone acetyltransf erase MOZ2" 

>gi I 24497588 | ref |NM_139058 . 1 I Homo sapiens aristaless related homeobox (ARX) 
mRNA 

Identities = 15/15 (100%) 

Query: 185 ggtgccaccacccgc 199 

I I I I I I I I I I I I I I I 
Sbjct: 663 ggtgccaccacccgc 649 
CDS 1. . 1689 

>gi I 4758787 | ref | NM_004551 . 1 1 Homo sapiens NADH dehydrogenase (ubiquinone) Fe 
protein 3, 30kDa (NADH-coenzyme Q reductase) (NDUFS3), mRNA 
Identities = 14/14 (100%) 

Query: 187 tgccaccacccgct 200 

I I I I I I I II I I I I I 
Sbjct: 675 tgccaccacccgct 662 
CDS 13.. 807 

>product="bA171A24 . 1 (RAR-related orphan receptor B) " gene="RORB"/ 
Identities = 27/29 (93%) 

Query: 189 ccaccacccgctcctcctcctgctgctgc 217 
I I I I II I I I I I I I I I I I I I I I I I I I I I 

Sbjct: 140523 ccaccaccaact cctcctcctgctgctgc 140551 



CDS: 140352. .>140970 



>gi I 23510326 | ref | NM_015692 . 1 | Homo sapiens alpha-2 macroglobulin family 
protein VIP (VIP), mRNA 
Identities = 21/22 (95%) 

Query: 195 cccgctcctcctcctgctgctg 216 

I I ! I I I I t I 1 I I I I I I I I 1 I I 
Sbjct: 47 cccgctcctgctcctgctgctg 68 

CDS 18 . . 5675 

/note="contains Kazal-type serine protease inhibitor domain" 

>gi I 21536391 I ref | NM_007037 . 2 | Homo sapiens a disintegrin-like and 
metalloprotease (reprolysin type) with thrombospondin type 1 motif, 8 (ADAMTS8), 
Identities = 29/32 (90%) 

Query: 196 ccgctcctcctcctgctgctgcttctgctgct 227 
I I I 1 I I I I I I I I I I I I I I I I I MINIM 

Sbjct: 741 ccgctcctgctgctgct gctgctgctgctgct 772 
CDS 7 08. .3380 

>gi I 7382453 | ref I NM_005199 . 3 | Homo sapiens cholinergic receptor, nicotinic, 
gamma polypeptide (CHRNG) , mRNA 
Identities = 22/23 (95%) 

Query: 196 ccgctcctcct cctgctgctgct 218 
I II I I I I II II I I I I II I I ! I I 

Sbjct: 19 ccgctgctcctcctgctgctgct 41 
CDS 1..1563 

>gi I 17738306 | ref | NM_006650 . 2 | Homo sapiens complexin 2 (CPLX2), mRNA 
Identities = 18/18 (100%) 

Query: 197 cgctcctcct cctgctgc 214 
I I I II I II I II I I I M I I 

Sbjct: 488 cgctcctcct cctgctgc 471 
CDS 346.. 750 

>gi I 21362089 I ref | NM_032667 . 2 | Homo sapiens Bernardinelli-Seip congenital 
lipodystrophy 2 ( seipin) (BSCL2 ) , mRNA 
Identities = 24/26 (92%) 

Query: 198 gctcctcctcctgctgctgcttctgc 224 
I I I I I I I I I I M I I II I I I I I II I 

Sbjct: 1797 gctcctgctcctgcttctgcttctgc 1822 
CDS 507 . .1901 

>gi I 21703365 I ref | NM_0227 66 . 3 | Homo sapiens ceramide kinase (CERK) , mRNA 
Identities = 21/22 (95%) 

Query: 198 gctcctcctcctgctgctgctt 219 
I I II M M II I I I II I I I I M 

Sbjct: 1216 gctcctcctccagctgctgctt 1195 

Identities = 14/14 (100%) 

Query: 204 cctcctgctgctgc 217 

I I II M M I II I I I 
Sbjct: 1586 cctcctgctgctgc 1599 

CDS 124 . . 1737 

/note=" lipid kinase LK4 " 
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>gi I 10198655 I ref I NM_020547 . 1 I Homo sapiens anti-Mullerian hormone receptor, 
type II (AMHR2), mRNA 
Identities - 20/20 (100%) 

Query: 199 ctcctcctcctgctgctgct 218 
I I I I I I I I I I I I I I I I I I I I 

Sbjct: 544 ctcctcctcctgctgctgct 563 
CDS 79.. 1800 

>gi | 7662013 | ref | NM_01474 5 . 1 I Homo sapiens KIAA0233 gene product (KIAA0233) , 
mRNA 

Identities = 30/32 (93%) 



Query: 199 ctcctcctcctgctgctgcttctgctgctcct 230 
I I I I I I I I I I I I I I I I I I I I III I I II I I I 

Sbjct: 803 ctcctcctcctgctgctgctgctgatgctcct 772 
CDS 3.. 6110 



>gi I 4504382 | ref I NM_001528 . 1 I Homo sapiens HGF activator (HGFAC) , mRNA 
Identities = 27/28 (96%) 

Query: 199 ctcctcctcctgctgctgcttctgctgc 226 
I I I I I I I I I I I I I I I I I I I I II I I I I I 

Sbjct: 64 ctcctcctcctgctgctgctgctgctgc 91 
CDS 4 . .1971 

>gi I 23111046 | ref | NM_152227 . 1 | Homo sapiens sorting nexin 5 (SNX5) , transcript 
variant 1, mRNA 
Identities = 18/18 (100%) 



Query: 200 tcctcctcctgctgctgc 217 
I I I II I I I I I I I I I I I I I 

Sbjct: 221 tcctcctcctgctgctgc 204 
CDS 181.. 1395 



>gi I 22507392 | ref j NM_02257 4 . 2 | Homo sapiens PERQ amino acid rich, with GYF 
domain 1 ( PERQ1 ) , mRNA 
Identities - 28/31 (90%) 



Query: 200 tcctcctcctgctgctgcttctgctgctcct 230 
I I I I I I I I I I I I II I I I I I I I I I I 1 I I I 

Sbjct: 1856 tcctcctcctgccgccgcttctgctcctcct 1826 
CDS 236.. 2689 



>gi I 10834965 | ref | NM_000404 . 1 | Homo sapiens galactosidase, beta 1 (GLB1 ) , 
transcript variant 179423, mRNA 
Identities = 26/28 (92%) 

Query: 200 tcctcctcctgctgctgcttctgctgct 227 

I I II I I I I I I I I I I I I I I I I I I I II I 
Sbjct: 83 tcctccttctgctgctggttctgctgct 110 
sig_peptide 61 . . 129 

CDS 61. .2094 

>gi|1814019|gb|U84408.1|HSU84408 Human IL-1 receptor related protein MyD88 
mRNA, complete cds 
Identities = 20/20 (100%) 
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Query: 201 cctcctcctgctgctgcttc 220 
I I I I I I I I I I I I I I I I I I I I 

Sbjct: 433 cctcctcctgctgctgcttc 414 
CDS 61.. 951 

>gi I 15929589 | gb| BC015219. 1 | BC015219 Homo sapiens, HBV associated factor, clone 
Identities = 25/27 (92%) 

Query: 201 cctcctcctgctgctgcttctgctgct 227 
I I I I I I I I I I I I I I I I I I I I I I I I I 

Sbjct: 1211 cctcctgctgctgctgcttccgctgct 1185 
CDS 434.. 1966 

>gi I 4885332 | ref | NM_005306 . 1 | Homo sapiens G protein-coupled receptor 43 
(GPR4 3), mRNA 
Identities - 22/23 (95%) 

Query: 201 cctcctcctgctgctgcttctgc 223 
I I I I I I I I I I I I I I I I I I I I I I 

Sbjct: 165 cctcctcctgctgctgctgctgc 187 
CDS 1. . 993 

/note="7tm_l ; Region: 7 transmembrane receptor (rhodopsin 
family) " 

>gi I 19111149 | ref | NM_133265 . 1 | Homo sapiens angiomotin (AMOT) , mRNA 
Identities = 24/26 (92%) 

Query: 202 ctcctcctgctgctgcttctgctgct 227 
I I I I I I I I I I I I I I I I I I I I I I I I 

Sbjct: 2448 ctccttctgctgctgctgctgctgct 2473 
CDS 797.. 2824 

>gi I 24 308357 | ref | NM_033253 . 1 | Homo sapiens 5 1 -nucleotidase, cytosolic IB 
(NT5C1B) , mRNA 
Identities = 24/26 (92%) 

Query: 203 tcctcctgctgctgcttctgctgctc 228 

I I I I I I I I I I I I I I I I I I I I 1 I I I 
Sbjct: 678 t cctcccgctgctgctgctgctgct c 653 

CDS 482.. 1690 

/note="5 l nucleotidase; autoimmune infertility-related protein; 5?-nucleotidase, 
cytosolic IB; cytosolic 5 1 -nucleotidase IB; 5#-nucleot idase, cytosolic IB" 

>gi I 115457 60 | ref j NM_022055 . 1 I Homo sapiens potassium channel, subfamily K, 
member 12 (KCNK12) , 
Identities = 22/23 (95%) 

Query: 204 cctcctgctgctgcttctgctgc 226 

i I I I I I I I I I I I I I I I I I I I I I 
Sbjct: 59 cctcctgctgctgctgctgctgc 81 

CDS 1. .1293 

/note="tandem pore domain potassium channel THIK-2" 

>gi I 5729946 | ref | NM_00668 1 . 1 I Homo sapiens neuromedin U (NMU) , mRNA 
Identities - 23/24 (95%) 

Query: 205 ctcctgctgctgcttctgctgctc 228 
I I I I I I II I I I I I I I I I I I I I I I 

Sbjct: 169 ctcctgctgctgctgctgctgctc 192 

CDS 106.. 630 

sig_peptide 106. .207 
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>gi I 4128016 I emb I AJ011701 . 1 1 HSA011701 Homo sapiens TRHR gene promoter and exons 
1-2, partial 

Identities = 20/20 (100%) 

Query: 205 ctcctgctgctgcttctgct 224 
I I I I I I I I I I I I I I I I I I I I 

Sbjct: 1732 ctcctgctgctgcttctgct 1751 
exon 1691. . 1946 

>gi [ 17985370 |gb|AF284094 . 1 IAF284094 Homo sapiens 13 binding protein (BRI3BP) 
mRNA, complete cds 
Identities = 22/23 (95%) 

Query: 205 ctcctgctgctgcttctgctgct 227 
I I I I I I I I I I I I I I I I I I I I I I 

Sbjct: 130 ctcctgctgctgctgctgctgct 152 
CDS 88. .843 

>gi I 15625294 | gb | AF286190 . 1 I AF286190 Homo sapiens VPS10 domain protein mRNA, 
Identities = 22/23 (95%) 

Query: 205 ctcctgctgctgcttctgctgct 227 
I I I I I I I I I I I I I I I I I I I I I I 

Sbjct: 5 ctcctgctgctgctgctgctgct 27 
CDS <1..>1251 

>gi 128761 | emb 1X13629.1 | HSAPOA4 Human intestinal mRNA for apolipoprotein A-IV 
Identities = 22/23 (95%) 

Query: 205 ctcctgctgctgcttctgctgct 228 
I I I I I I I I I I I I I I I I I I I I I I 

Sbjct: 1203 ctcctgctgctgct cctgctgct 1181 
CDS 46.. 1236 

>gi I 17458351 | ref | XM_047011 . 2 | Homo sapiens protein O-f ucosyltransf erase 1 
(POFUT1), mRNA 
Identities = 23/24 (95%) 

Query: 206 tcctgctgctgcttctgctgctcc 229 
I I I I I I I IS I I I I I I I I I I I I I I 

Sbjct: 90 tcctgctgctgcttctgccgctcc 113. 
CDS 50. .1216 

>gi I 15812213 | ref 1 NM_033431 . 1 | Homo sapiens phosphodiesterase 5A, cGMP-specif ic 
( PDE5A) , transcript variant 4, mRNA 
Identities = 20/20 (100%) 

Query: 208 ctgctgctgcttctgctgct 227 
I I I I I I I I I I I I I I I I I I I I 

Sbjct: 227 ctgctgctgcttctgctgct 208 
CDS 156. .2753 

>gi | 3252778 j db j I D89094.ll Homo sapiens mRNA for 3', 5' -cyclic GMP 
phosphodiesterase, complete 
Identities = 20/20 (100%) 

Query: 208 ctgctgctgcttctgctgct 227 
I I I I I I I I I I I I I I I I I I I I 

Sbjct: 391 ctgctgctgcttctgctgct 372 
CDS 320. .2947 
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>gi| 114969851ref |NM_012072.2| Homo sapiens complement component 1, q 
subcomponent, receptor 1 (C1QR1), mRNA 
Identities = 22/23 (95%) 

Query: 208 ctgctgctgcttctgctgctcct 230 
I I 1 I I I I I I I I I I I I I I I I I I I 

Sbjct: 170 ctgctgctgctgctgctgctcct 192 
CDS 149.. 2107 

>gi I 4809281 | ref | NM_0017 18 . 2 | Homo sapiens bone morphogenet ic protein 6 (BMP6) , 
Identities = 22/23 (95%) 

Query: 208 ctgctgctgcttctgctgctcct 230 

I I I I I I I I I I I I I I I I I I I I I I 
Sbjct: 533 ctgctgctgctgctgctgctcct 511 

CDS 180.. 1721 

/note="Vg-related sequence; transforming growth factor-beta" 

>gi | 14777259 | ref |XM_027568 . 1 | Homo sapiens similar to interleukin 9 receptor 
(LOC146316) , mRNA 
Identities = 22/23 (95%) 

Query: 208 ctgctgctgcttctgctgctcct 230 
I I I I I I I I I I I I I I I I I I I I I I 

Sbjct: 1294 ctgctgctgctgctgctgctcct 1272 
CDS 660.. 1547 

>gi I 2626738 | dbj IAB005060. 1 | Homo sapiens mRNA for NTAK, complete cds 
Identities = 22/23 (95%) 

Query: 208 ctgctgctgcttctgctgctcct 230 
I I 1 I I I I I I I I I I I I I I I I I I I 

Sbjct: 341 ctgctgctgctgctgctgctcct 319 
CDS 226.. 2778 

>gi I 4507106 | ref | NM_003086 . 1 | Homo sapiens small nuclear RNA activating 
complex, polypeptide 4,90kDa ( SNAPC4 ) , mRNA 
Identities = 21/22 (95%) 

Query: 208 ctgctgctgcttctgctgctcc 229 

I I I I I I I I 1 I I I I I I I I I I I I 
Sbjct: 1994 ctgctgctgctgctgctgctcc 1973 

Identities = 27/29 (93%), Gaps = 1/29 (3%) 

Query: 200 tcctcctc-ctgctgctgcttctgctgct 227 
I I I I I I I I I I I I I I I I I I i I I I I I I I I 

Sbjct: 2009 tcctcctcgctgctgctgctgctgctgct 1981 
CDS 376.. 4785 

>gi I 21237798 | ref | NM_139205 . 1 | Homo sapiens histone deacetylase 5 (HDAC5 ) , 
transcript variant 2, 
Identities = 21/22 (95%) 

Query: 208 ctgctgctgcttctgctgctcc 229 

I I I I I I I I I I I I I I I I I 1 I I I 
Sbjct: 1888 ctgctgctgcttctgcttctcc 1867 
Identities = 15/15 (100%) 
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Query: 205 ctcctgctgctgctt 219 

I I I I I I I I I ! I I I I I 
Sbjct: 670 ctcctgctgctgctt 656 

CDS 305.. 3418 

/note="isof orm 2 is encoded by transcript variant 2; antigen NY-CO-9" 

>gi | 2564750 | gb | AF029308 . 1 | HTCRBCHR9 Homo sapiens chromosome 9 duplication of 
the T cell receptor beta locus and trypsinogen gene families 
Identities = 19/19 (100%) 

Query: 209 tgctgctgcttctgctgct 227 
I I I I I I I I I I I I I I I I I I I 

Sbjct: 11391 tgctgctgcttctgctgct 11409 
V_segment join(11390. .11420 



>gi|1296750|emb|Z4 9234.1|HSTCRB2X2 H . sapiens gene for T-cell receptor TCRBV2 . 2 
Identities = 19/19 (100%) 

Query: 209 tgctgctgcttctgctgct 227 
I I I I I I I I I I I I I I I I I I I 

Sbjct: 976 tgctgctgcttctgctgct 994 
CDS 975..>1141 

>gi | 13027808 | ref | NMJD22718 . 1 | Homo sapiens matrix metalloproteinase 25 
(MMP25 ) , transcript variant 2, mRNA 
Identities = 18/18 (100%) 

Query: 210 gctgctgcttctgctgct 227 
I I I I I I I I I I I I I I I I I I 

Sbjct: 264 gctgctgcttctgctgct 281 
CDS 238.. 1926 

>gi | 12803104 | gb | BC002356 . 1 | BC002356 Homo sapiens, nucleobindin 1, clone 
Identities = 21/22 (95%) 

Query: 213 gctgcttctgctgct cct 230 

I I I I I I I I I I I I I I I I I 
Sbjct: 68 gctgctgctgctgctcct 85 

Query: 208 ctgctgctgcttctgctgctcc 229 
I I I I I I I I I I I I I I I I I I I I I 

Sbjct: 1235 ctgctgctgcttccgctgctcc 1214 
CDS 27.. 1412 

>gi I 244 96766 | ref | NM_004712 . 3 | Homo sapiens hepatocyte growth factor-regulated 
tyrosine kinase substrate (HGS) , mRNA 
Identities = 15/15 (100%) 

Query: 215 tgcttctgctgctcc 229 

I I I I I I I I I I I I I I I 
Sbjct: 1732 tgcttctgctgctcc 1718 
CDS 78.. 2411 

>gi I 20544115 | ref | XM_059933 . 5 I Homo sapiens similar to putative 
lysophosphatidic acid acyltransf erase (LOC137964), mRNA 
Identities = 15/15 (100%) 

Query: 216 gcttctgctgctcct 230 
I I I I I I I I I I I I I I I 

Sbjct: 2233 gcttctgctgctcct 2219 
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CDS 



913. .2283 



>gi|22770666|gb|AF531299.1| Homo sapiens histone HI (HIST1H1A) gene, complete 
Identities = 15/15 (100%) 

Query: 216 gcttctgctgctcct 230 

I I I I I I I I I I I I I I I 
Sbjct: 551 gcttctgctgctcct 565 
CDS 521.. 1168 

>gi I 18390058 | gb |AF4 63518 . 1 | Homo sapiens cell recognition protein CASPR4 
(CASPR4) mRNA, complete 
Identities = 15/15 (100%) 

Query: 216 gcttctgctgctcct 230 
I I I I I I I I I I I I I I I 

Sbjct: 1568 gcttctgctgctcct 1582 

CDS 140. .4075 

/product="cell recognition protein CASPR4 " 

>gi I 21359973 | ref |NM_053276.2 I Homo sapiens vitrin (VIT) , mRNA 
Identities = 19/20 (95%) 

Query: 216 cttctgctgctcctaccacc 236 
I I I I I I I I I I I I I I I I I I I 

Sbjct: 835 ctt ctgctgct tctaccacc 854 
CDS 222.. 2303 

>gi | 23308602 | ref j NM_015460 . 1 I Homo sapiens myosin VIIA and Rab interacting 
protein (MYRIP) , mRNA 
Identities = 15/15 (100%) 

Query: 220 ctgctgctcctacca 234 

I I I I I I I I I I I I I I I 
Sbjct: 409 ctgctgctcctacca 423 

CDS 137.. 2716 

/product="myosin VIIA and Rab interacting protein" 

>gi | 15559110 | gb | AF4 14 087 . 1 | Homo sapiens small breast epithelial mucin mRNA, 
complete cds 

Identities = 16/16 (100%) 

Query: 220 ctgctgct cctaccac 235 

I I I I I I I I I I I I I I I I 
Sbjct: 210 ctgctgctcctaccac 225 
CDS 47 . . 319 

note="SBEM; secreted protein; similar to Mus musculus 

>gi I 22779205 |dbj | AB083783. 1 | Homo sapiens slac2-c mRNA for Sip homologue 
lacking C2 domains-c, complete cds 
Identities = 15/15 (100%) 

Query: 220 ctgctgctcctacca 234 
I I I I I I I I I I I I I I I 

Sbjct: 273 ctgctgctcctacca 287 
CDS 1..2580 

>gi | 7705267 | ref | NM_016255 . 1 | Homo sapiens family with sequence similarity 8, 
member Al (FAM8A1), 
Identities = 15/15 (100%) 



16 



Query: 226 ctcctaccaccgccg 240 
I I 1 I I I I I I I I I I I I 

Sbjct: 147 ctcctaccaccgccg 161 
CDS 56.. 1297 

/product="Autosomal Highly Conserved Protein" 

>gi | 14210509 I ref I NM_032496. 1 | Homo sapiens rho-gtpase activating protein 
ARHGAP9 (ARHGAP9) , mRNA 
Identities = 15/15 (100%) 

Query: 240 gcctctggtgccacc 254 
I I I I I I I I I I I I I I I 

Sbjct: 2335 gcctctggtgccacc 2349 
CDS 407 . .2659 

/product="hypothetical protein MGC12959" 

>gi I 184756 | gb | J00221 . 1 | HUMIGCD7 Human Ig germline H-chain G-E-A region B 
alpha-2 A2m ( 1 ) allele constant region, 3' end 
Identities = 15/15 (100%) 

Query: 240 gcctctggtgccacc 254 

I I I I I I I I I I I I I I I 
Sbjct: 800 gcctctggtgccacc 814 

CDS join (<164 . . 469, 684 . . 1004, 1227 . . 1621) 

/product="immunoglobulin alpha-2 heavy chain" 

>gi | 21361375 | ref | NM_007165 . 2 | Homo sapiens splicing factor 3a, subunit 2 
66kDa (SF3A2), mRNA 
Identities = 16/16 (100%) 

Query: 248 tgccaccgcccccgcc 263 
I I I I I I I I I I I I I I I I 

Sbjct: 876 tgccaccgcccccgcc 891 

CDS 125.. 1519 

/note="Spliceosome protein SAP-62; splicing factor 3a, 
subunit 2, 66kD" 

>gi | 21361291 | ref |NM_005611 . 2 | Homo sapiens ret inoblastoma-like 2 (pl30) 
( RBL2 ) , mRNA 

Identities = 15/15 (100%) 

Query: 249 gccaccgcccccgcc 263 

I I I I I I I I I I I I I I I 
Sbjct: 93 gccaccgcccccgcc 107 
CDS 70.. 3489 

>gi I 4505322 | ref | NM_003941 . 1 | Homo sapiens Wiskott-Aldrich syndrome-like 
(WASL) , mRNA 

Identities = 15/15 (100%) 

Query: 249 gccaccgcccccgcc 263 

I I I I I I I I I I I I I I I 
Sbjct: 1406 gccaccgcccccgcc 1420 
CDS 255.. 1772 

>gi I 5901937 | ref I NM_007021 . 1 | Homo sapiens decidual protein induced by 
progesterone (DEPP), mRNA 
Identities = 15/15 (100%) 



Query: 249 gccaccgcccccgcc 263 

I I I I I I I I I I I I I I I 
Sbjct: 787 gccaccgcccccgcc 801 
CDS 219. .857 



>gi | 22045278 | ref |XM_001334 . 5 | Homo sapiens POU domain, class 3, transcription 
factor 1 (POU3F1), 
Identities = 16/16 (100%) 

Query: 250 ccaccgcccccgccgg 265 

I I I I I I I I I I I I I I I I 
Sbjct: 1326 ccaccgcccccgccgg 1341 

CDS 36.. 1391 

/product= n similar to Octamer-binding transcription factor 6 (OCT-6) (POU-domain 
transcription factor SCIP) (TST-1)" 

>gi I 2423474 9 | ref | NM_012218 . 2 | Homo sapiens interleukin enhancer binding factor 
3, 90kDa (ILF3), transcript variant 1, mRNA 
Identities = 15/15 (100%) 

Query: 250 ccaccgcccccgccg 264 

I I I I I I I I I I I I I I I 
Sbjct: 2389 ccaccgcccccgccg 2375 

CDS 267. .2951 

/note="isof orm a is encoded by transcript variant 1; double-stranded RNA-binding 
protein, 76 kD; M-phase phosphoprotein 4; nuclear factor associated with dsRNA; 
nuclear factor of activated T-cells, 90 kD; translat ional control protein 80" 



>gi| 19074 9| gb | M96684.ll HUMPURA H. sapiens Pur (pur-alpha) mRNA, complete cds 
Identities - 16/16 (100%) 



Query: 250 ccaccgcccccgccgg 265 

I I I I I I I I I I I I I I I I 
Sbjct: 163 ccaccgcccccgccgg 148 

CDS 60. . 1028 

/f unction="sequence-specif ic single-stranded DNA binding protein" 

>gi I 4 885128 | ref | NM_005194 . 1 | Homo sapiens CCAAT/enhancer binding protein 
(C/EBP) , beta (CEBPB) , 
Identities = 15/15 (100%) 



Query: 250 ccaccgcccccgccg 264 
I I I I I I I I I I I I I I I 

Sbjct: 484 ccaccgcccccgccg 498 
CDS 1..1038 



>gi I 24850134 | ref I NM_170695 . 1 1 Homo sapiens TGFB-induced factor (TALE family 
homeobox) (TGIF) , 
Identities = 18/19 (94%) 



Query: 253 ccgcccccgccggtgcccg 271 
I I I I I I I I I I I I I I I I I I 

Sbjct: 374 ccgcccccgccggagcccg 356 
CDS 304.. 1509 



>gi I 2080594 6 I gb | AY083269 . 1 1 Homo sapiens transcription factor mammalian MafA 
gene, complete cds 
Identities = 18/19 (94%) 
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Query: 253 ccgcccccgccggtgcccg 271 
I I I I I I I I I I I I I I I I I I 

Sbjct: 236 ccgccgccgccggtgcccg 218 
CDS 1..1059 

>gi|5058992|gb|U66095.1|U66095 Homo sapiens cell-line THP-1 GTP cyclohydrolase 
I mRNA, complete 
Identities = 15/15 (100%) 

Query: 257 ccccgccggtgcccg 271 
I I I I I I I I I I I I I I I 

Sbjct: 177 ccccgccggtgcccg 163 
CDS 145.. 846 

>gi I 4507750|ref | NM_00107 1 . 1 | Homo sapiens thymidylate synthetase (TYMS) , mRNA 
Identities = 15/15 (100%) 

Query: 261 gccggtgcccgtgcg 275 
I I 1 I I I I I i I I I I I I 

Sbjct: 267 gccggtgcccgtgcg 253 
CDS 106.. 1047 

>gi|17511946|gb|BC018929.1|BC018929 Homo sapiens, Similar to T-cell death 
associated gene, clone 
Identities = 15/15 (100%) 

Query: 261 gccggtgcccgtgcg 275 
I I I I I I I I I I I I I I I 

Sbjct: 1029 gccggtgcccgtgcg 1015 
CDS 279.. 1058 

>gi | 6679302 | ref | NM_007350 . 1 | Homo sapiens pleckstrin homology-like domain, 
family A, member l(PHLDAl), mRNA 
Identities = 15/15 (100%) 

Query: 261 gccggtgcccgtgcg 275 
I I I I I I I I I I I I I I I 

Sbjct: 1333 gccggtgcccgtgcg 1319 
CDS 160.. 1362 

>gi I 23097243 | ref | NM_152891 . 1 | Homo sapiens serine protease EOS (EOS), mRNA 
Identities = 19/20 (95%) 

Query: 263 cggtgcccgtgcgaccggtg 282 
I I I I I I I I I I I I I I I I I I I 

Sbjct: 385 cggtgcccgtgcgacgggtg 404 
CDS 69. . 923 

>gi I 20555609 | ref I XM_165720 . 1 I Homo sapiens HCR (a-helix coiled-coil rod 
homologue) (HCR), mRNA 
Identities = 15/15 (100%) 

Query: 275 gaccggtggtggtag 289 

I I I I I I I I I I I I I I I 
Sbjct: 1573 gaccggtggtggtag 1559 
CDS 80.. 2428 

>gi I 10947055 | ref | NM_020987 . 1 | Homo sapiens ankyrin 3, node of Ranvier (ankyrin 
G) (ANK3), transcript variant 1, mRNA 
Identities = 21/21 (100%) 
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Query: 280 gtggtggtagtggtggtggtg 300 

I I I I I 1 I I I I I I I I I I 1 I I I I 
Sbjct: 12113 gtggtggtagtggtggtggtg 12093 



Identities - 21/22 (95%) 

Query: 279 ggtggtggtagtggtggtggtg 300 

I I I I I I I I I I I I I I I I I I I I I 

Sbjct: 12096 ggtggtggcagtggtggtggt g 12075 

Identities = 18/18 (100%) 

Query: 279 ggtggtggtagtggtggt 296 
I I I I I I I I I I I I I I I I I I 

Sbjct: 12123 ggtggtggtagtggtggt 12106 

Identities = 20/22 (90%) 

Query: 279 ggtggtggtagtggtggtggtg 300 
I I I I I I II I I I I I I I I I I I I 

Sbjct: 12111 ggtggtagtggtggtggtggtg 12090 

Identities = 19/21 (90%) 

Query: 279 ggtggtggtagtggtggt ggt 299 
I I I I I I I I I II I I I I I I I I 

Sbjct: 12126 ggtggtggtggtagt ggtggt 12106 

Identities = 13/13 (100%) 

Query: 279 ggtggtggtagtg 291 
I I I I I I I I I I i I I 

Sbjct: 12081 ggtggtggtagtg 12069 
CDS 193.. 13326 

>gi I 7662227 | ref I NM_01484 1 . 1 | Homo sapiens synaptosomal-associated protein, 
91kDa homolog (mouse) (SNAP91) , mRNA 
Identities = 20/20 (100%) 

Query: 279 ggtggtggtagtggtggtgg 298 
I I I I I I I I I I I I I I I I I I I I 

Sbjct: 1890 ggtggtggtagtggtggtgg 1871 
CDS 244. .2967 

Identities = 14/14 (100%) 

Query: 279 ggtggtggtagtgg 292 
I I I I I I I I I I I I I I 

Sbjct: 717 ggtggtggtagtgg 704 

Identities = 19/21 (90%) 

Query: 279 ggtggtggtagtggtggtggt 299 
I I I I I I II I II I I I I I I I I 

Sbjct: 729 ggtggtagtggtggtggtggt 709 
CDS 320.. 1168 
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>gi | 54 53935 | ref | NM_0 06236 . 1 | Homo sapiens POU domain, class 3, transcription 
factor 3 (POU3F3) , 
Identities = 21/22 (95%) 

Query: 279 ggtggtggtagtggtggtggtg 300 
I I I I I I I I I I I I I I I I I II I I 

Sbjct: 829 ggtggtggtggtggtggtggtg 808 
CDS 1..1503 

>gi I 21396478 | ref | NM_005924 . 2 | Homo sapiens mesenchyme homeo box 2 (growth 

arrest-specific homeo box) (MEOX2), mRNA 

Identities = 21/22 (95%) 

Query: 279 ggtggtggtagtggtggtggtg 300 

I I I I I I I I I I I I I I I I I I I I I 
Sbjct: 404 ggtggtggtggtggtggtggtg 383 

Identities = 21/22 (95%) 

Query: 279 ggtggtggtagtggtggtggtg 300 
I I I I I I I I I I I I I I I I I I I I I 

Sbjct: 407 ggtggtggtggtggtggtggtg 386 

Identities = 20/22 (90%) 

Query: 279 ggtggtggtagtggtggtggtg 300 
I I I I I I I I I I I I I I I I I I I I 

Sbjct: 416 ggtggtgatggtggtggtggtg 395 
CDS 182.. 1093 

>gi I 21361336 | ref | NM_001969 . 2 | Homo sapiens eukaryotic translation initiation 
factor 5 (EIF5), mRNA 
Identities = 21/22 (95%) 

Query: 279 ggtggtggtagtggtggtggtg 300 
I I I I I I I I I I I I I I I I I I I I I 

Sbjct: 1022 ggtggtggtggtggtggtggtg 1001 

Identities = 19/20 (95%) 

Query: 281 tggtggtagtggtggtggtg 300 
I I I I I I I I I I I I I I I I I I I 

Sbjct: 1023 tggtggtggtggtggtggtg 1004 
CDS 469.. 1764 

>gi I 12597624 [ref | NM_012068 . 2 | Homo sapiens activating transcription factor 5 
(ATF5), mRNA 

Identities = 20/20 (100%) 

Query: 281 tggtggtagtggtggtggtg 300 
I I I I I I I I I I I I I I I I I I I I 

Sbjct: 730 tggtggtagtggtggtggtg 711 

>gi I 20127494 | ref I NM_006237 . 2 | Homo sapiens POU domain, class 4, transcription 
factor 1 (POU4F1) , 
Identities = 21/22 (95%) 

Query: 279 ggtggtggtagtggtggtggtg 300 
I I I I I I I I I I I I I I I I I I I I I 

Sbjct: 553 ggtggtggtggtggtggtggtg 532 
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Identities = 21/22 (95%) 



Query: 279 ggtggtggtagtggtggtggtg 300 
I I I I I I I I I I I I I I I I I I I I I 

Sbjct: 556 ggtggtggtggtggtggtggtg 535 

Identities = 19/20 (95%) 

Query: 281 tggtggtagtggtggtggtg 300 

I I I I I I I I I I I I I I I I I I I 
Sbjct: 560 tggtggtggtggtggtggt g 541 

CDS " ^ 235.. 14 97 

>gi I 20379115 | gb| AF498971 . 1 | Homo sapiens small GTP binding protein RhoB (ARHB) 
mRNA, complete 
Identities = 15/15 (100%) 

Query: 299 tggtggtggtgggcg 313 
I I I I I I I I I I I I I I I 

Sbjct: 23 tggtggtggtgggcg 37 
CDS 1..591 

>gi I 190939 | gb | M38453 . 1 | HUMRASTG Human ras transforming protein gene, exon 1 
Identities = 15/15 (100%) 

Query: 299 tggtggtggtgggcg 313 
I I I I I I I I I I I I I 1 I 

Sbjct: 14 9 tggtggtggtgggcg 163 
exon 133 . .243 

>gi | 20544140 | ref | NM_003185 . 2 j Homo sapiens TAF4 RNA polymerase II, TATA box 
binding protein (TBP) -associated factor, 135kDa (TAF4), mRNA 
Identities = 14/14 (100%) 

Query: 299 tggtggtggtgggc 312 
I I I I I I I I I I I I I I 

Sbjct: 125 tggtggtggtgggc 112 
CDS 1..3252 

>gi I 21361862 | ref |NM_033104 .2 | Homo sapiens stonin 2 (STN2), mRNA 
Identities = 14/14 (100%) 

Query: 299 tggtggtggtgggc 312 

I I I I I I I I I I I I I I 
Sbjct: 2110 tggtggtggtgggc 2097 

CDS 202.. 2919 

>gi|22597105|gb|AF521671.1| Homo sapiens SWI/SNF chromatin remodeling complex 
subunit OSA2 

Identities = 14/14 (100%) 

Query: 299 tggtggtggtgggc 312 
I I I I I I I I I I I I I I 

Sbjct: 68 tggtggtggtgggc 55 
CDS <1. . 6498 

>gi I 22051956 | ref I XM_113625 . 2 | Homo sapiens similar to Antrax toxin receptor 
precursor (Tumor endothelial marker 8) (LOC195977), mRNA 
Identities = 14/14 (100%) 

Query: 299 tggtggtggtgggc 312 

I I I I I I I I i I I I I I 
Sbjct: 411 tggtggtggtgggc 398 
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CDS 



251. .1171 



>gi|17863992|gb|AF449430.1|AF449430 Homo sapiens endocytosis protein HSTNB 
variant mRNA, complete cds 
Identities = 14/14 (100%) 

Query: 299 tggtggtggtgggc 312 

I I I I I I I I I I I I I I 
Sbjct: 2110 tggtggtggtgggc 2097 

CDS 202. .2919 

>gi I 11065969 | gb| AF193855 . 1 | AF193855 Homo sapiens zinc finger protein of 
cerebellum ZIC2 (ZIC2) mRNA, 
Identities = 14/14 (100%) 

Query: 299 tggtggtggtgggc 312 
I I I I I I I I I I I I I I 

Sbjct: 701 tggtggtggtgggc 688 
CDS 1..1599 

>gi | 17 47 4 021 | ref I XM_058 523 . 1 | 
(LOC121015) , mRNA 
Identities = 14/14 (100%) 

Query: 302 tggtggtgggcggg 315 

I I I I I I I I I I I I I I 
Sbjct: 117 tggtggtgggcggg 104 
CDS 2 6.. 4 18 

>gi | 21361620 I ref | NMJD02633 . 2 | 
Identities = 14/14 (100%) 

Query: 302 tggtggtgggcggg 315 
I I I I I I I ! I I I I I I 

Sbjct: 383 tggtggtgggcggg 396 
CDS 214.. 1902 

>gi I 6979929 j gb | AF221759. 1 | AF221759 Homo sapiens Maml mRNA, partial cds 
Identities = 14/14 (100%) 

Query: 303 ggtggtgggcgggt 316 
I I I I I I I I I I I I I I 

Sbjct: 164 3 ggtggtgggcgggt 1630 
CDS <1..2682 

>gi I 4505736 | ref | NM_002621 . 1 | Homo sapiens properdin P factor, complement 
(PFC), mRNA 

Identities = 14/14 (100%) 

Query: 306 ggtgggcgggtact 320 
I I I I I I I I I I I I I I 

Sbjct: 1493 ggtgggcgggtact 1480 
CDS 243.. 1652 

>gi I 20127484 | ref | NM_005654 . 2 | Homo sapiens nuclear receptor subfamily 2, group 
F, member 1 (NR2F1), mRNA 
Identities = 14/14 (100%) 

Query: 319 tagcgcgacgtggg 332 

I I I I I I I I I I I I I I 
Sbjct: 705 tagcgcgacgtggg 692 
CDS 98.. 1369 



Homo sapiens similar to MDM2 variant FB29 



Homo sapiens phosphoglucomutase 1 (PGM1), mRNA 
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/note="Transcription factor COUP 1 (chicken ovalbumin upstream promoter 1, ; 
transcription factor COUP 1 " 



>gi | 16418 382 |ref |NM_0528 76. 1 | 
(NAC1) , mRNA 

Identities = 15/15 (100%) 



Homo sapiens transcriptional repressor NAC1 



Query: 326 acgtgggcgaccagt 340 

I I I I I I I I I I I I I I I 
Sbjct: 404 acgtgggcgaccagt 418 

CDS 127 . . 1710 

/note="contains POZ domain" 

>gi | 164 4 54 31 I ref |NM_033662. 1 I 
transcript variant 3, mRNA 
Identities = 16/16 (100%) 



Homo sapiens WD repeat domain 4 (WDR4 ) , 



Query: 328 gtgggcgaccagtggc 343 

I I I I I I I I I I 1 I I I I I 
Sbjct: 882 gtgggcgaccagtggc 897 

CDS 363.. 1163 

/note="isof orm 2 is encoded by transcript variant 3; WD repeat-containing 
protein 4" 



>gi I 22027 4 97 | ref | NM_0122 95 . 2 | 
(CABIN1) , mRNA 
Identities = 15/15 (100%) 

Query: 334 gaccagtggctgctg 348 
I I I I I I I I I I I I I I I 

Sbjct: 1787 gaccagtggctgctg 1801 
CDS 128.. 6790 



Homo sapiens calcineurin binding protein 1 



>gi I 20336259 | ref | NM_0158 66 . 2 | Homo sapiens PR domain containing 2, with ZNF 
domain ( PRDM2 ) , transcript variant 2, mRNA 
Identities = 14/14 (100%) 

Query: 336 ccagtggctgctgg 349 

I I I I I I I I I I I I I I 
Sbjct: 3329 ccagtggctgctgg 3316 

CDS 857.. 5905 

/note="isoform b is encoded by transcript variant 2; zinc-finger DNA-binding 
protein; retinoblastoma protein-interacting zinc finger protein; MTE-binding 
protein" 



>gi | 2204 2322 | ref | XMJ3154 28 . 3 | 
(LOC151313), mRNA 
Identities = 14/14 (100%) 

Query: 337 cagtggctgctggg 350 

I I I I I I I I I I I I I I 
Sbjct: 620 cagtggctgctggg 633 
CDS 11.. 955 

>gi | 20270612 | ref | NG_001318 . 1 I 
Identities = 14/14 (100%) 

Query: 337 cagtggctgctggg 350 
I I I I I I I I I I I I I I 

Sbjct: 2499 cagtggctgctggg 2486 
misc feature 1..2882 



Homo sapiens similar to CGI-105 protein 



Homo sapiens HSP40 pseudogene (HSP40) 
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>gi I 21265045 I ref I NM_139027 . 1 I Homo sapiens a disintegrin-like and 
metalloprotease (reprolysin type) with thrombospondin type 1 motif, 13 
(ADAMTS13) , transcript variant 2, mRNA 
Identities = 17/17 (100%) 

Query: 339 gtggctgctgggctggg 355 
I I I I I I I I I I I I I I I I I 

Sbjct: 3812 gtggctgctgggctggg 3828 

CDS 445. .4560 

(vWF) -cleaving protease, which is responsible for cleaving at the 

>gi 122050832 jrefl XM_114 8 63 . 2 | Homo sapiens similar to alpha 2 type IV collagen 
preproprotein; canstatin (LOC203630) , mRNA 
Identities = 16/16 (100%) 

Query: 339 gtggctgctgggctgg 354 

I I I I I I I I I I I I I I I I 
Sbjct: 450 gtggctgctgggctgg 465 

CDS 1..1425 
/product="similar to alpha 2 type IV collagen preproprotein; canstatin" 

>gi I 22064 435 I ref | XM_017037 . 3 | Homo sapiens suppressor of Ty 6 homolog (S. 
cerevisiae) (SUPT6H) , 
Identities = 15/15 (100%) 

Query: 340 tggctgctgggctgg 354 
I I I I I I ! I I I I I I I I 

Sbjct: 4624 tggctgctgggctgg 4610 
CDS 161.. 4972 

>gi|20149786|gb|AF039196.3| Homo sapiens putative single zinc finger 
transcription factor protein (hairless) mRNA, complete cds 
Identities = 15/15 (100%) 

Query: 341 ggctgctgggctggg 355 

I I I I I I I I I I I I I I I 
Sbjct: 4218 ggctgctgggctggg 4204 

CDS 1485.. 5054 

/note="restricted expression in the brain and skin" 

>gi I 20070162 | ref | NM_018896 . 2 | Homo sapiens calcium channel, voltage-dependent, 
alpha 1G subunit (CACNA1G) , mRNA 
Identities = 15/15 (100%) 

Query: 342 gctgctgggctgggt 356 
I I I I I I I I I I I I I I I 

Sbjct: 1440 gctgctgggctgggt 1426 
CDS 1..7134 

>gi I 18 5644861 ref | XM_094865. 1 | 
(LOC168119), mRNA 
Identities = 15/15 (100%) 

Query: 343 ctgctgggctgggtt 357 
I I I I I I I I I I I I I I I 

Sbjct: 49 ctgctgggctgggtt 35 
CDS 1 . . 978 

>gi | 4 503532 | ref I NM_0014 17 . 1 | 
factor 4B (EIF4B) , 



Homo sapiens similar to Olfactory receptor 4F3 



Homo sapiens eukaryotic translation initiation 
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Identities = 16/16 (100%) 

Query: 350 gctgggttcacgtggt 365 
I I I I I I I I I I I I I I I I 

Sbjct: 393 gctgggttcacgtggt 378 
CDS 1 . . 1836 

>gi I 4557252 | ref j NM_001109 . 1 | Homo sapiens a disintegrin and metalloproteinase 
domain 8 (ADAM8) , 
Identities = 15/15 (100%) 

Query: 360 cgtggtggtggttct • 37 4 
I I I I I I I I I I I I I I I 

Sbjct: 1980 cgtggtggtggttct 1994 
CDS 10. .2484 

>gi | 20143921 | ref I NM_133437 . 1 | Homo sapiens titin (TTN) , transcript variant 
novex-2, mRNA 
Identities = 17/18 (94%) 

Query: 360 cgtggtggtggttctcca 377 
I I I I I I I I I I I I I I I I I 

Sbjct: 33611 cgtgatggtggttctcca 33628 

CDS 224.. 81580 

/note="isof orm novex-2 is encoded by transcript variant novex-2; connectin; 
CMH9 , included" 

>gi | 22726242 | gb | BC037404 . 1 | Homo sapiens, Similar to formin binding protein 4, 
clone MGC: 3674 9 
Identities = 14/14 (100%) 

Query: 361 gtggtggtggttct 374 

I i I I I I I I ! I I i I I 
Sbjct: 2743 gtggtggtggttct 2730 
CDS 28.. 3075 

>gi I 4557234 | ref | NM_000018 . 1 | Homo sapiens acyl-Coenzyme A dehydrogenase, very 
long chain (ACADVL) , nuclear gene encoding mitochondrial protein, mRNA 
Identities - 14/14 (100%) 

Query: 362 tggtggtggttctc 375 
I I I I I I I 1 I I I I I I 

Sbjct: 1818 tggtggtggttctc 1831 
CDS 86.. 2053 

>gi j 22048275 | ref | XM_06334 6 . 3 I Homo sapiens similar to polypyrimidine-tract 
binding protein ( LOC1228 8 8 ) , mRNA 
Identities = 14/14 (100%) 

Query: 363 ggtggtggttctcc 376 

I I I I I I I I I I I I I I 
Sbjct: 1345 ggtggtggttctcc 1332 
CDS 1..1461 

>gi|4885524 | ref | NM_0054 90 . 1 I Homo sapiens SH2 domain containing 3A (SH2D3A) , 
Identities = 15/15 (100%) 

Query: 390 ctgcgcgctcctcca 414 

I I I I I I I I I I I I I I I 
Sbjct: 1319 ctgcgcgctcctcca 1305 
CDS 152. .1882 
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>gi I 22060841 I ref | XM_1 65659 . 2 | Homo sapiens elaC homolog 1 (E. coli) (ELAC1 ) , 
mRNA 

Identities = 16/16 (100%) 

Query: 397 ctcctccatcacccac 412 

I I I I I I I I I I I I I I I I 
Sbjct: 894 ctcctccatcacccac 879 

CDS 108 . . 1199 

/product="similar to elaC homolog 1 ( E . coli); similar to Escherichia coli elaC; 
hypothetical protein D29; elaC 



>gi I 9055315 | ref | NM_012406. 2 | Homo sapiens PR domain containing 4 ( PRDM4 ) , mRNA 
Identities = 15/15 (100%) 



Query: 397 ctcctccatcaccca 411 
I I I I I I I I I I I I I I I 

Sbjct: 1109 ctcctccatcaccca 1123 
CDS 123.. 2528 



>gi I 2204 6620 | ref | XM_069728 . 3 I Homo sapiens similar to beta-glucuronidase 
(LOC136132) , mRNA 
Identities = 15/15 (100%) 



Query: 399 tcctccatcacccac 413 
I I I I I I I I I I I I I I I 

Sbjct: 663 tcctccatcacccac 649 
CDS 1..972 



>gi I 19263734 | gb | BC025358 . 1 | Homo sapiens, ATP-binding cassette, sub-family D 
(ALD) , member 1, 
Identities = 15/15 (100%) 

Query: 400 ctccatcacccaccg 414 
I I I I I I I I I I I I I I I 

Sbjct: 2364 ctccatcacccaccg 2378 
CDS 400.. 2637 

>gi I 11968022 | ref jNM_022473. 1 | Homo sapiens zinc finger protein 106 (ZFP106), 
mRNA 

Identities = 15/15 (100%) 



Query: 400 ctccatcacccaccg 414 

I I I I I I I I I I I I I I I 
Sbjct: 3838 ctccatcacccaccg 3852 

CDS 336.. 5987 

>gi I 18586624 | ref | XM_085530 . 1 | Homo sapiens similar to Adrenoleukodystrophy 
protein (ALDP) 
Identities = 15/15 (100%) 

Query: 400 ctccatcacccaccg 414 

I I I I I I I I I I I I I I I 
Sbjct: 310 ctccatcacccaccg 324 

CDS 134 . . 583 

misc_feature 161.. 376 

>gi I 1974 3876 | ref I NM_002918 . 2 | Homo sapiens regulatory factor X, 1 (influences 
HLA class II expression) (RFX1), mRNA 
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Identities = 15/15 (100%) 



Query: 406 cacccaccgctgctg 420 
I I I I I I I I I I I I I I I 

Sbjct: 220 cacccaccgctgctg 234 
CDS 93.. 3032 



>gi I 19263734 | gb | BC025358 . 1 1 Homo sapiens, ATP-binding cassette, sub-family D 
(ALD) , member 1 , 
Identities = 15/15 (100%) 
Query: 410 ctccatcacccaccg 424 

I I I I I I I I I I I I I I I 
Sbjct: 2364 ctccatcacccaccg 2378 
CDS 400.. 2637 

>gi I 18586624 | ref | XM_085530 . 1 | Homo sapiens similar to Adrenoleukodystrophy 
protein (ALDP) (LOC146640), mRNA 
Identities = 15/15 (100%) 

Query: 410 ctccatcacccaccg 424 
I I I I I I I I I I I I I I I 

Sbjct: 310 ctccatcacccaccg 324 
CDS 134.. 583 

>gi I 11968022 | ref | NM_022473 . 1 | Homo sapiens zinc finger protein 106 (ZFP106), 
mRNA 

Identities = 15/15 (100%) 

Query: 410 ctccatcacccaccg 424 

I I I I I I I I I I I! I I I 
Sbjct: 3838 ctccatcacccaccg 3852 
CDS 336.. 5987 

>gi I 13184045 | ref | NM_023944 . 1 | Homo sapiens cytochrome P450, subfamily IVF, 
polypeptide 12 (CYP4F12), mRNA 
Identities = 17/17 (100%) 

Query: 413 cgctgctgagcctgccc 429 
I I I I I I I I I I I I I I I I I 

Sbjct: 35 cgctgctgagcctgccc 51 
CDS 31 . .1605 

>gi | 23821028 I ref | NM_153486. 1 I Homo sapiens lactate dehydrogenase D (LDHD) , 
mRNA 

Identities = 20/22 (90%) 

Query: 424 ctgcccgacgcgcggctcctgc 445 
I I I I 1 1 I I I I I I I I I I I I I I 

Sbjct: 637 ctgcccgacgggcggctgctgc 658 
CDS 58.. 1518 

>gi I 24528580 | gb| AF079529. 2 | Homo sapiens cAMP-specif ic phosphodiesterase 8B1 
(PDE8B) mRNA, complete cds ; alternatively spliced 
Identities = 15/15 (100%) 

Query: 427 cccgacgcgcggct c 441 
I I I I I I I I I I I I I I I 

Sbjct: 224 cccgacgcgcggctc 210 
CDS 46.. 2703 
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>gi I 22001416 I ref | NM_0154 65 . 1 I Homo sapiens gemin 5 (GEMIN5), mRNA 
SMN complex component; Sm-interacting protein; DKFZP58 6M1824 protein 
Identities = 14/14 (100%) 

Query: 434 cgcggctcctgccc 447 
I I I I I I I I I I I I I I 

Sbjct: 17 cgcggctcctgccc 4 
CDS 1. .4527 

>gi | 29459 | emb | X68149 . 1 I HSBLR1A Homo sapiens BLR1 gene for Burkitt's lymphoma 
receptor 1 

Identities = 18/19 (94%) 

Query: 434 cgcggctcctgcccaagct 452 
I I I I I I I I I I I I I I II I I 

Sbjct: 1085 cgcggctcctgaccaagct 1103 
CDS 85.. 1203 

>gi I 840783 | emb 1X68829.1 I HSMDCR H. sapiens mRNA for MDR15 protein 
Identities = 18/19 (94%) 

Query: 434 cgcggctcctgcccaagct 452 
I I I I I I I I I I I I I I I I I I 

Sbjct: 1154 cgcggctcctgaccaagct 1172 
CDS 289.. 1272 



>gi I 5032094 | ref | NM_005630 . 1 | Homo sapiens solute carrier family 21 
(prostaglandin transporter), member 2 ( SLC21A2 ) , mRNA 
Identities = 17/17 (100%) 



Query: 437 ggctcctgcccaagctc 453 

I I I I I I I I I I I I I I I I I 
Sbjct: 88 ggctcctgcccaagctc 104 

CDS 84.. 2015 

>gi | 4505876 | ref | NM_00044 5 . 1 | Homo sapiens plectin 1, intermediate filament 
binding protein 500kDa (PLEC1), mRNA 
Identities = 17/18 (94%) 



Query: 438 gctcctgcccaagctcct 455 

I I I I I I I I MINIMI 
Sbjct: 7156 gctcctgcgcaagctcct 7139 

CDS 52.. 13776 

/f unction="high molecular weight cytoskeletal-associated protein which is a 
component of hemidesmosomes in basal kerat inocytes and also a component of the 
sarcolemma in muscle (HD1) 



>gi | 22058106 | ref | XM_171754 . 1 | Homo sapiens similar to a disintegrin-like and 
metalloprotease (reprolysin type) with thrombospondin type 1 motif, 17 
preproprotein (LOC257018), mRNA 
Identities = 15/15 (100%) 



Query: 438 gctcctgcccaagct 452 

I I I I I I I I I I I I I I I 
Sbjct: 91 gctcctgcccaagct 77 
CDS 1..4 20 



>gi I 4505936 | ref | NM_002693 . 1 | Homo sapiens polymerase (DNA directed), gamma 
(POLG) , nuclear gene encoding mitochondrial protein, mRNA 
Identities = 14/14 (100%) 
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Query: 438 gctcctgcccaagc 451 

I I I I I I I I I I I I I I 
Sbjct: 1953 gctcctgcccaagc 1966 
CDS 283. . 4002 

>gi|533527|gb|U10694.1|HSU10694 Human MAGE- 9 antigen (MAGE 9) gene, complete 
cds 

Identities = 18/19 (94%) 

Query: 438 gctcctgcccaagctcctg 456 

I I I I I I I I I I I I I I I I I I 
Sbjct: 1290 gctcctgcccacgct cctg 1308 
exon 1268 . . >2845 

CDS 1333.. 2280 

>gi|4758361 |ref I NM_004 1 12 . 1 | Homo sapiens fibroblast growth factor 11 ( FGF1 1 ) , 
mRNA 

Identities - 16/16 (100%) 

Query: 442 ctgcccaagctcctgg 457 
I I I I I I I I I I I I I I I I 

Sbjct: 589 ctgcccaagctcctgg 604 
CDS 1..67 8 

>gi I 13435128 | ref |NM_02208 9. 1 | Homo sapiens putative ATPase (HSA9947), mRNA 
Identities = 17/18 (94%) 

Query: 442 ctgcccaagctcctggtc 459 
I I I I I I I I I I I I I I I I I 

Sbjct: 2555 ctgcccaaggtcctggtc 2572 
CDS 35.. 3577 

>gi I 24308028 | ref |NM_015296. 1 | Homo sapiens ziziminl (ziziminl), mRNA 
Identities = 14/14 (100%) 

Query: 445 cccaagctcctggt 458 

I I I I I I I I I I I I I I 
Sbjct: 2431 cccaagctcctggt 2418 

CDS 56. .6265 

note="Cdc42 activator" 

>gi I 22043940 | ref I XM_060678 . 5 I Homo sapiens similar to Synaptotagmin II (Sytll) 
(LOC127833), mRNA 
Identities = 14/14 (100%) 

Query: 445 cccaagctcctggt 458 
I I I I I I I I I I I I I I 

Sbjct: 540 cccaagctcctggt 527 
CDS 1..1278 

>gi I 185995861 ref I XM_092362 . 1 1 Homo sapiens similar to 
evidence : NAS-hypothetical protein-putative (LOC165086) , mRNA 
Identities = 17/18 (94%) 

Query: 448 aagctcctggtctaggag 465 

I I I I I I I I I I I I I I I I I 
Sbjct: 776 aagctcctggtctgggag 759 
CDS 1 . . 936 

>gi | 23336903 | tpg | BK000566.il TPA: Homo sapiens SF3bl25 DEAD-box protein mRNA, 
complete cds, 
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Identities = 14/14 (100%) 



Query: 452 tcctggtctaggag 465 
I I I I I I I I I I I I I I 

Sbjct: 399 tcctggtctaggag 386 
CDS 1. .2460 

>gi 122052707 | ref I XM_1 72523. 1 I 
receptor (LOC256135), mRNA 
Identities = 14/14 (100%) 

Query: 460 taggagtaaggcca 473 
I I I I I I I I I I I I I I 

Sbjct: 410 taggagtaaggcca 397 
CDS 81 . . 560 

>gi | 474428 I emb | Z31702 . 1 | HSP53DN 
Identities = 16/16 (100%) 



Homo sapiens similar to N-formyl peptide 



H . sapiens p5-3 DNA 



Query: 461 aggagtaaggccatgg 476 
I I I I I I I I I i I I I I I I 

Sbjct: 264 aggagtaaggccatgg 279 
1. .1464 

Non-homologous recombination within the major histocompatibility 
complex creates a transcribed hybrid sequence 

>gi I 18550199 | ref | XM_059368 . 2 | Homo sapiens similar to thymidylate kinase 
family LPS-inducible member; thymidylate kinase homologue (LOC129607), mRNA 
Identities = 14/14 (100%) 

Query: 467 aaggccatgggcgc 480 
I I I I I I I I I I I I I I 

Sbjct: 23 aaggccatgggcgc 10 
CDS 16.. 1365 

>gi | 6706902 | emb| AL109827 . 8 | HSJ309K20 Human DNA sequence from clone RP1-309K20 
on chromosome 20 Contains the gene for a novel protein similar to dysferlin, the 
SPAG4 gene for sperm associated antigen 4, the CPNE1 gene for Copine I (similar 
to KIAA0636), the gene KIAA0765 (HRIHFB2091) for an RNA recognition motif 
( RNP, RRM or RBD domain) containing protein and the 3' end of the NIFS 
gene for cysteine desulfurase. 
Identities = 16/16 (100%) 

Query: 468 aggccatgggcgcggc 483 
I I I I I I I I I I I I I I I I 

Sbjct: 7506 aggccatgggcgcggc 7491 
CDS ~~ 7357.. 7540 



>gi | 21614532 I ref I NM_144957 . 1 | Homo sapiens protease, 
( PRSS21 ) , transcript variant 3, mRNA 
Identities = 14/14 (100%) 



serine, 21 (testisin) 



Query: 468 aggccatgggcgcg 481 
I I I I I I I I I I I I I I 

Sbjct: 102 aggccatgggcgcg 115 
CDS ' 107 . . 1009 



>gi | 18087852 I ref I NM_080672 . 
Identities = 18/19 (94%) 



II 



Homo sapiens Q9H4T4 like (H17739) , mRNA 
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Query: 4 69 ggccatgggcgcggccggc 4 87 
I I I I I I I I I i I I I I I I I I 

Sbjct: 141 ggccatgggcccggccggc 123 
CDS 136. . 1815 

>gi I 790818 | gb I L39891 . 1 | HUMPKD1GEN Homo sapiens polycystic kidney disease- 
associated protein (PKD1) gene, 
Identities = 16/16 (100%) 

Query: 475 gggcgcggccggccgc 490 

I I I I I I I I I I I I I I I I 
Sbjct: 50097 gggcgcggccggccgc 50082 
CDS " 49997. .50171 

>gi I 24429581 | ref | NM_153813 . 1 | Homo sapiens friend of GATA-1 (FOG1), mRNA 
Identities = 14/14 (100%) 

Query: 475 gggcgcggccggcc 487 
I I I I I I I I 1 I I I I I 

Sbjct: 2503 gggcgcggccggcc 2490 
CDS " 323. .3337 

>gi I 22094134 | ref | NM_032482 . 1 | Homo sapiens histone methyltransf erase DOT1L 

(DOT1L) , mRNA 

Identities = 14/14 (100%) 

Query: 476 ggcgcggccggccg 489 

I I I I I I I I I I I I I I 
Sbjct: 2314 ggcgcggccggccg 2301 

CDS 37 . . 4650 

f unction="methylates lysine 7 9 of histone H3" 

>gi I 4759111 | ref | NM_004207 . 1 | Homo sapiens solute carrier family 16 
(monocarboxylic acid transporters), member 3 (SLC16A3), mRNA 
Identities = 14/14 (100%) 

Query: 477 gcgcggccggccgc 490 
I I I I I I I I I I I I I I 

Sbjct: 891 gcgcggccggccgc 904 
CDS ~~ 63.. 14 60 

>gi I 20544351 | ref | XM_005702 . 8 I Homo sapiens wingless-type MM TV integration site 
family, member 8B (WNT8B) , mRNA 
Identities = 14/14 (100%) 

Query: 477 gcgcggccggccgc 4 90 

I I I I I I I I I I I I I I 
Sbjct: 808 gcgcggccggccgc 821 
CDS 129. . 1184 

>gi I 23452045 | gb| AF494409. 1 | Homo sapiens pantothenate kinase 2 mRNA, complete 
cds 

Identities = 15/15 (100%) 

Query: 480 cggccggccgcctct 494 
I I I I I I I I I 1 I I I I I 

Sbjct: 279 cggccggccgcctct 265 
CDS 7 . . 1719 

gi I 2213644 | gb | U63833 . 1 | HSU63833 Human PAX 6 gene, promoter region and exons 1 
and 2 
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Identities = 16/16 (100%) 

Query: 482 gccggccgcctctgct 4 97 
I I I I I I I I I I I I I I I I 

Sbjct: 114 gccggccgcctctgct 129 
promoter 1..3274 

/note="isof orm 2 is encoded by transcript variant 2; ortholog of mouse CNR5; 
KIAA0345-like 1" 

>gi I 14589892 | ref I NM_0017 94 . 2 | Homo sapiens cadherin 4, type 1, R-cadherin 
(retinal) (CDH4 ) , mRNA 
Identities = 15/15 (100%) 

Query: 483 ccggccgcctctgct 4 97 
I I I I I I I I I I I I II I 

Sbjct: 2841 ccggccgcctctgct 2827 
CDS 15.. 2765 

This gene is a classical cadherin from the cadherin superfamily. The encoded 
protein is a calcium-dependent cell-cell adhesion glycoprotein, studies 

>gi I 11545830 | ref | NM_022114 . 1 | Homo sapiens PR domain containing 16 (PRDM16), 
mRNA/ Identities = 15/15 (100%) 

Query: 483 ccggccgcctctgct 497 
I I I I I I I I I I I I I I I 

Sbjct: 1638 ccggccgcctctgct 1652 
gene 1 . .4376 

>gi | 141 65 396| re fj NM_0318 65 . 1 I Homo sapiens protocadherin alpha 13 ( PCDHA13 ) , 
transcript variant 2 , 
Identities = 16/16 (100%) 

Query: 484 cggccgcctctgctgc 499 
I I I I I I I I I I I I I I I I 

Sbjct: 2268 cggccgcctctgctgc 2253 
CDS 1. .2424 

>gi I 12697311 | embl AJ295990. 1 | HSA295990 Homo sapiens mRNA for putative chromatin 
modulator, alternative splice NSD3L 
Identities = 15/15 (100%) 

Query: 485 ggccgcctctgctgc 4 99 

I I I I I I I I I I I I I I I 
Sbjct: 4514 ggccgcctctgctgc 4528 

CDS 314 . .4627 

/note="alternative splice NSD3L" 

<gi|12642816|gb|AF332469.1|AF332469 Homo sapiens putative protein WHSC1L11 
(WHSC1L1) mRNA, complete cds, alternatively spliced 
Identities = 15/15 (100%) 

Query: 485 ggccgcctctgctgc 499 
I I I I I I I I I I I I II I 

Sbjct: 4719 ggccgcctctgctgc 4733 
CDS 519.. 4832 

>gi I 174744 63 | ref | XM_062506 . 1 | Homo sapiens similar to 60S acidic ribosomal 
protein P2 (LOC121193) , mRNA 
Identities = 15/15 (100%) 
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Query: 485 ggccgcctctgctgc 4 99 
I I I I I I I I I I I I I I I 

Sbjct: 278 ggccgcctctgctgc 292 
CDS 1..351 

product=" similar to 60S acidic ribosomal protein P2" 

>gi I 22035673 | ref I NM_006031 . 2 I Homo sapiens pericentrin 2 (kendrin) (PCNT2), 
mRNA 

Score =30.2 bits (15), Expect = 88 
Identities = 15/15 (100%) 

Query: 487 ccgcctctgctgcag 501 

I I I I I I I I I I I I I I I 
Sbjct: 3442 ccgcctctgctgcag 3428 

CDS 53. . 10063 

The protein encoded by this gene binds to calmodulin and is expressed in the 
centrosome . 

>gi I 12620204 | gbjAF288398 . 1 1AF288398 Homo sapiens Clorfl4 mRNA, complete cds 
Identities - 16/16 (100%) 

Query: 489 gcctctgctgcagatg 504 
I I I I I I I I I I I I I I I I 

Sbjct: 1282 gcctctgctgcagatg 1267 

CDS 69. .2246 

note="alternatively spliced" 

>gi|9622520|gb|AF173157.1|AF173157 Homo sapiens MSTP098 (MST098) mRNA, 
complete cds 

Identities = 16/16 (100%) 

Query: 489 gcctctgctgcagatg 504 
I I I I I I I I I I I I I I I I 

Sbjct: 329 gcctctgctgcagatg 314 
CDS 239.. 460 

>gi I 2204 6810 | ref | XM_089096 . 2 | Homo sapiens similar to coxsackievirus and 
adenovirus receptor-like protein (LOC163724 ) , mRNA 
Identities = 15/15 (100%) 

Query: 490 cctctgctgcagatg 504 
I I I I I I I I I I I I I I I 

Sbjct: 578 cctctgctgcagatg 564 
CDS 291. . 1466 

misc_feature 552.. 674 

>gi | 20070194 | ref | NM_006140 . 2 | Homo sapiens colony stimulating factor 2 
receptor, alpha, low-affinity (granulocyte-macrophage) (CSF2RA) , mRNA 
Identities = 17/18 (94%) 

Query: 512 tctgcgaccagtggcacc 529 

I I I I I I I I I I I I I I I I I 
Sbjct: 248 tctgcgaacagtggcacc 265 

CDS 171 . . 1373 

>gi| 15990415 I gb | BC015569 . 1 IBC015569 Homo sapiens, Similar to SRp25 nuclear 
protein, 

Identities = 14/14 (100%) 

Query: 517 gaccagtggcaccg 430 
I I I I I I I I I I I I I I 

Sbjct: 406 gaccagtggcaccg 419 
CDS 37 . . 684 
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>gi I 13624213 I gb I AF319045 . 1 I AF319045 Homo sapiens contactin-associated protein 
2 (CNTNAP2 ) mRNA, complete 
Identities = 14/14 (100%) 



Query: 517 gaccagtggcaccg 530 
I I I I I I I I I I I I I I 

Sbjct: 2778 gaccagtggcaccg 2791 
CDS 141.. 4136 



>gi I 14702161 | ref |NM_032421 . 1 | Homo sapiens cytoplasmic linker 2 (CYLN2) , 
transcript variant 2, 
Identities = 14/14 (100%) 

Query: 519 ccagtggcaccgcc 532 
I I I I I I I I I I I I I I 

Sbjct: 314 ccagtggcaccgcc 327 
CDS " 323.. 3363 

note="synonyms: WSCR4 , WBSCR4 , CLIP-115, KIAA0291, MGC11333" 
note="isoform 2 is encoded by transcript variant 2; 
Williams-Beuren syndrome chromosome region 4" 

>gi I 22044320 I ref I XM_08 617 8 . 5 I Homo sapiens agrin (AGRN) , mRNA 
Identities = 14/14 (100%) 

Query: 419 ccagtggcaccgcc 432 

I I I I I I I I I I I I I I 
Sbjct: 4325 ccagtggcaccgcc 4338 

CDS 366. . 6107 

>gi | 4885506 I ref |NM_005468 . 1 I Homo sapiens N-acetylated alpha-linked acidic 
dipeptidase-like; ILEAL DIPEPTIDYLPEPTIDASE (NAALADASEL) , mRNA 
Identities = 14/14 (100%) 



Query: 520 cagtggcaccgccg 533 

I I I I I I I I I I I I I I 
Sbjct: 1156 cagtggcaccgccg 1169 

CDS 17.. 2239 

/f unction= "peptidase" 



>gi I 23943861 | ref | NM_020378 . 1 I Homo sapiens K562 cell-derived leucine-zipper- 
like protein 1 (KLP1), mRNA 
Identities = 15/15 (100%) 
Query: 523 tggcaccgccgccgg 538 

I I I I I I I I I I I I I I I 
Sbjct: 486 tggcaccgccgccgg 500 
CDS 90.. 710 

note="K562 cells-derived leucine-zipper-like protein 1" 

>gi I 20384 683 | gbl AY030095 . 1 | Homo sapiens pancreasin mRNA, complete cds 
Identities = 17/17 (100%) 

Query: 524 ggcaccgccgccggccg 540 

I I I I I I I I I I I I I I I I I 
Sbjct: 23 ggcaccgccgccggccg 7 

CDS 1. .873 

note="CAPH2; channel-activating protease 2; tryptic serine protease; similar to 
marapsin" 

>gi | 20552317 | ref |XM_096904 . 4 j Homo sapiens Kruppel-like factor 13 (KLF13), 
Identities = 15/15 (100%) 
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Query: 525 gcaccgccgccggcc 539 
I I i I I I I I I I I I I I I 

Sbjct: 20 gcaccgccgccggcc 34 
CDS 47.. 913 

note="synonyms: BTEB3, FKLF2, NSLP1, FKLF-2, RFLAT1, RFLAT-1" 
/product="similar to Krueppel-like factor 13 

>gi I 14727714 | ref |XM_045113. 1 | Homo sapiens astrotactin (ASTN) , mRNA 
Identities = 15/15 (100%) 

Query: 525 gcaccgccgccggcc 539 

I I I I I I I I I I I I I I I 
Sbjct: 72 gcaccgccgccggcc 58 

CDS ~ 15.. 3899 

product="similar to KIAA0289" 

>gi | 22538424 | ref I NM_145691 . 2 | Homo sapiens ATP synthase mitochondrial Fl 
complex assembly factor 2 (ATPAF2) , nuclear gene encoding mitochondrial protein, 
Identities = 14/14 (100%) 

Query: 525 gcaccgccgccggc 538 
I I I I I I I I I I I I I I 

Sbjct: 947 gcaccgccgccggc 960 
CDS 154.. 1023 

>gi I 22065230 | ref I XM_035037 . 2 | Homo sapiens low density lipoprotein receptor- 
related protein 4 (LRP4), mRNA 
Identities = 14/14 (100%) 

Query: 526 caccgccgccggcc 539 
I I I I I I I I I I I I I I 

Sbjct: 2107 caccgccgccggcc 2120 

CDS 232 . .4839 

product="similar to MEGF7 " 

>gi I 20545806 | ref I XM_007095 . 6 I Homo sapiens insulin receptor substrate 2 
(IRS2), mRNA 

Identities = 14/14 (100%) 

Query: 526 caccgccgccggcc 539 

I I I I I I I I I I I I I I 
Sbjct: 3613 caccgccgccggcc 3626 
CDS 516.. 4532 

>gi I 4506228 | ref | NM_002809. 1 1 Homo sapiens proteasome (prosome, macropain) 26S 
subunit, non-ATPase, 3 { PSMD3 ) , mRNA 
Identities = 14/14 (100%) 

Query: 52 6 caccgccgccggcc 539 
I I I I I I I I I I I I I I 

Sbjct: 237 caccgccgccggcc 250 
CDS 158 . . 1762 

>gi|1657753|gb|U63721.1|HSU63721 Human elastin (ELN) gene, partial cds, and 
LIM-kinase (LIMK1) gene, 
Identities = 14/14 (100%) 

Query: 526 caccgccgccggcc 539 

I I I I I I I I I I I I I I 
Sbjct: 23610 caccgccgccggcc 23597 
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exon 



23511. .23730 

/gene="LIMKl" 



>gi I 5823551 1 gb | AF180522 . 1 1 AF180522 Homo sapiens intersectin short form 2 
(ITSN) mRNA, partial cds 
Identities = 14/14 (100%) 

Query: 526 caccgccgccggcc 539 
I I I I I I I I I I I I I I 

Sbjct: 158 caccgccgccggcc 171 
CDS <1. .566 

>gi | 1816 52 | gb | M8 5247. 1| HUMDOPAM H. sapiens dopamine D1A receptor gene, complete 
exon 1, and exon 2, 5 ? end 
Identities = 14/14 (100%) 

Query: 531 ccgccggccgttct 543 

I I I I I I I I I I I I I I 
Sbjct: 1097 ccgccggccgttct 1110 

misc_signal 842 . .1231 

f unction="negative trancript ional modulator" 

>gi I 23395757 | tpgl BK000395.il TPA: Homo sapiens aflatoxin Bl-aldehyde reductase 
(AKR7A2) mRNA, 
Identities = 14/14 (100%) 

Query: 544 tcgccaccgcccag 567 

I I I I I I I I I I I I I I 
Sbjct: 312 tcgccaccgcccag 299 

CDS 23. . 1102 

/note="aldo-keto reductase; serves as a gamma-hydroxybutyrate synthase; the 
full-length protein is predicted to contain 29 additional amino acids at the 
N-terminus that have not been recognized previously" 

>gi I 7706102 | ref | NM_016568 . 1 | Homo sapiens G-protein coupled receptor SAL PR; 
somatostatin andangiotensin-like peptide receptor ( SAL PR) , mRNA 
Identities = 14/14 (100%) 

Query: 547 ccaccgcccagaag 560 
I I I I I I I I I I I I I I 

Sbjct: 781 ccaccgcccagaag 768 

CDS 361.. 1770 

misc_feature 697 . . 1527 

note="7tm_l; Region: 7 transmembrane receptor (rhodopsin 
family) " 

>gi I 18568985 | ref | XM_095373 . 1 | Homo sapiens similar to Neutrophil defensin 4 
precursor (HNP-4) (HP4) (Defensin, alpha 4) (LOC157295), mRNA 
Identities = 14/14 (100%) 

Query: 552 gcccagaagcagcc 565 

I I I I I I I I I I I I I I 
Sbjct: 181 gcccagaagcagcc 194 

CDS 1..375 
misc_feature 145.. 243 

>gi I 22053899 I ref I XM_092083 . 2 | Homo sapiens similar to golgi autoantigen, 
golgin subfamily a, 2; golgin-95 (LOC163220) , mRNA 
Identities = 14/14 (100%) 
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Query: 552 gcccagaagcagcc 565 

I I I I i I 1 I i I I I I I 
Sbjct: 505 gcccagaagcagcc 518 
CDS ' 1. . 1452 



>gi I 22053411 I ref |XM_050604 . 4 | Homo sapiens AIE-75 binding protein protein 
(MCC2), mRNA 

Identities = 14/14 (100%) 

Query: 553 cccagaagcagccc 566 
I I I I I I II I I I I I I 

Sbjct: 407 cccagaagcagccc 420 
CDS ~ 114.. 2225 



>gi I 4505326 | ref | NM_000263 . 1 j Homo sapiens N-acetylglucosaminidase, alpha- 
(Sanfilippo disease IIIB) (NAGLU) , mRNA 
Identities = 17/18 (94%) 



Query: 553 gcccagaagcagcccgcc 569 

I I I I I I I I I I I I I I I I I 
Sbjct: 1609 gcccagaagctgcccgcc 1626 

CDS 333.. 2564 

/f unction="one of four enzymes involved in the degradation of heparan sulfate; 
specifically removes the alpha-N-acetylglucosamine residues" 

>gi 1 15718672 | ref I NM_000683 . 2 | Homo sapiens adrenergic, alpha-2C-, receptor 
(ADRA2C) , mRNA 
Identities = 14/14 (100%) 



Query: 553 cccagaagcagccc 566 
I I I I I I I I I I I I I I 

Sbjct: 2530 cccagaagcagccc 2517 

CDS 892.. 2277 

/note="alpha2-AR-C4 " 

/product ="alpha-2C-adrenergic receptor" 



>gi | 15812179 I ref |NM_004926. 2 I Homo sapiens zinc finger protein 36, C3H type- 
like 1 ( ZFP36L1 ) , " mRNA 
Identities = 15/15 (100%) 

Query: 553 cccagaagcagcccg 578 

I I I I II I I I I I I I I I 
Sbjct: 432 cccagaagcagcccg 446 
CDS 131.. 1147 

/note="EGF-response factor 1; early response factor Berg36; zinc finger protein, 
C3H type, 36-like 1" 

/product="butyrate response factor 1" 

>gi I 20556994 | ref |XM_039877 . 5 I Homo sapiens mucin 5, subtype B, 
tracheobronchial (MUC5B) , mRNA 
Identities = 17/18 (94%) 



Query: 555 cagaagcagcccgccgcc 572 
I I I I I I I I I I I I I I I I I 

Sbjct: 1580 cagaagcagccctccgcc 1563 
CDS 4 6.. 2688 



/db_xref="MIM: 600770" 

>gi I 22060317 | ref |XM_1144 98 . 2 | Homo sapiens similar to Tcte-1 peptide 
(LOC202500), mRNA 
Identities = 14/14 (100%) 
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Query: 556 agaagcagcccgcc 569 
I I I I I I I I I I I I I I 

Sbjct: 1317 agaagcagcccgcc 1330 
CDS ~ 346.. 1401 

>gi| 4505032 |ref |NM_000752.1 | Homo sapiens leukotriene B4 receptor (LTB4R) , 
Identities = 19/20 ( 95% ) /note="Chemokine receptor-like 1 

Query: 559 agcagcccgccgccggcgca 578 
I I I I I I I I I I I I I I I I I I I 

Sbjct: 2595 agcaggccgccgccggcgca 2576 CDS: 1718.. 2776 



>gi| 1648869|emb|X98356. 1 I HSGPCRCO H. sapiens mRNA for G protein-coupled 
receptor 

Identities = 19/20 (95%) 

Query: 559 agcagcccgccgccggcgca 578 
I I I I I I I I I I I I I I I I I I I 

Sbjct: 1266 agcaggccgccgccggcgca 1247 CDS: 389.. 1447 

>gi I 5032110 | ref | NM_005634 . 1 | Homo sapiens SRY (sex determining region Y)-box 3 
(SOX3) , mRNA 

Identities = 15/15 (100%) 

Query: 559 agcagcccgccgccg 573 

I I I I I I I I I I I I I I I 
Sbjct: 862 agcagcccgccgccg 876 
CDS 1. . 1332 

>gi | 2 054 8 635 | ref | XM_167923 . 1 | 
Identities = 14/14 (100%) 

Query: 560 gcagcccgccgccg 573 
I I I I I I I I I I I I I I 

Sbjct: 116 gcagcccgccgccg 129 
CDS 49..S43 

>gi I 8 9237 92 | ref I NM_017514 . 1 | 
(PLXN3), mRNA 
Identities = 14/14 (100%) 

Query: 561 cagcccgccgccgg 574 
I I I I I I I I I I I I I I 

Sbjct: 2896 cagcccgccgccgg 2909 
CDS 185.. 5800 

>gi | 9247121 I gb I AF284224 . 1 | AF284 224 Homo sapiens DMRT2 and terra-like protein 
(DMRT2) bicistronic mRNA, 
Identities = 16/16 (100%) 

Query: 561 cagcccgccgccggcg 576 

I I I I I I I I I I I I I I I I 
Sbjct: 111 cagcccgccgccggcg 126 

CDS 1..681 
/not e= "putative transcription factor" 

>gi|6179565|emb|Y19052.1|HOSA19052 Homo sapiens mRNA for doublesex-like 2 
protein (DSXL-2 gene) 
Identities - 16/16 (100%) 



Homo sapiens homeobox protein Gsh-1 (Gsh-1) , 



Homo sapiens likely ortholog of mouse plexin 3 
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Strand = Plus / Plus 



Query: 561 cagcccgccgccggcg 576 
I I I I I I I I I I I I I I I I 

Sbjct: 565 cagcccgccgccggcg 580 
CDS 455.. 1135 



>gi | 14165271 | ref I NMJD32 4 09 . 1 | Homo 
(PINK1), mRNA 
Identities = 14/14 (100%) 

Query: 561 cagcccgccgccgg 574 
I I I I I I I I I I I I I I 

Sbjct: 213 cagcccgccgccgg 200 

CDS 95. . 1840 

/note="protein kinase BRPK" 

>gi | 21536278 | ref | NM_00637 1 . 2 | Homo 

(CRTAP), mRNA 

Identities = 14/14 (100%) 



sapiens PTEN induced putative kinase 1 



sapiens cartilage associated protein 



Query: 562 agcccgccgccggc 575 

I I I I I I I I I I I I I I 
Sbjct: 295 agcccgccgccggc 308 

CDS 12.. 1217 

The human cartilage associated protein is homologous to the chick and mouse 
CRTAP genes. 

>gi I 4504 64 6 | ref | NM_001560 . 1 | Homo sapiens interleukin 13 receptor, alpha 1 
(IL13RA1), mRNA 
Identities = 15/15 (100%) 



Query: 564 cccgccgccggcgca 578 
I I I I I I I I I I I I I I I 

Sbjct: 103 cccgccgccggcgca 89 
CDS 44.. 1327 



>gi I 4507206 | ref I NM_003130 . 1 | Homo sapiens sorcin (SRI), mRNA 
Identities = 14/14 (100%) 

Query: 564 cccgccgccggcgc 577 
I I I 1 II I I I I I I I I 

Sbjct: 48 cccgccgccggcgc 35 
CDS 13.. 609 



>gi I 21735549 | ref | NM_002446 . 2 | Homo sapiens mitogen-activated protein kinase 
kinase kinase 10 (MAP3K10 ) , mRNA 
Identities - 15/15 (100%) 

Query: 57 6 cagcccgccgccggc 5 90 
I I I I I I 1 I I I I I I I I 

Sbjct: 2885 cagcccgccgccggc 2899 
CDS 289.. 3153 

/note="mixed lineage kinase 2; MKN28 kinase; MKN28 derived 
nonreceptor_type serine /threonine kinase" 

>gi I 18548972 | ref |XM_089318 . 1 | Homo sapiens similar to ATP-dependent DNA 
helicase MER3 (HFM1 protein) (LOC164045) , mRNA 
Identities = 14/14 (100%) 



Query: 580 ttcttcccgccgcc 593 
I I I I I I I I I I i I I I 

Sbjct: 67 ttcttcccgccgcc 80 
CDS 1..2196 



>gi I 1392 94 61 | re f | NM_0014 97 . 2 | Homo sapiens UDP-Gal : betaGlcNAc beta 1,4- 
galactosyltransf erase, polypeptide 1 (B4GALT1), mRNA 
Identities = 14/14 (100%) 

Query: 581 tcttcccgccgccg 595 
I I I I I I I I I I I I I I 

Sbjct: 73 tcttcccgccgccg 60 
CDS 73.. 1269 



>gi I 21359847 | ref | NM_001194 . 2 | Homo sapiens hyperpolari zat ion activated cyclic 
nucleotide-gated potassium channel 2 (HCN2), mRNA 
Identities = 14/14 (100%) 

Query: 581 tcttcccgccgccg 594 

I I I I I I I I I I I I I I 
Sbjct: 2191 tcttcccgccgccg 2204 

CDS 54.. 2723 

function="pacemaker channel of human heart" 

note="cyclic nucleotide-gated; brain cyclic nucleotide gated channel 2" 

>gi I 18032269 | gb I AF274003 . 1 | AF274003 Homo sapiens splicing-related factor- RNPS1 
(RNPS1) mRNA, complete cds, alternatively spliced 
Identities = 14/14 (100%) 

Query: 581 tcttcccgccgccg 594 
I I I I I I I I I I I I I I 

Sbjct: 46 tcttcccgccgccg 33 
CDS 46.. 894 

>gi I 4504382 | ref |NM_001528 . 1 | Homo sapiens HGF activator (HGFAC) , mRNA 
Identities = 15/15 (100%) 

Query: 590 cgccgttcttctcgc 604 
I I I I I I I I I I I I I I I 

Sbjct: 1828 cgccgttcttctcgc 1814 
CDS 4. .1971 

>gi I 14196470 | ref |NM_032054 . 1 | Homo sapiens protocadherin gamma subfamily A, 5 
(PCDHGA5), transcript variant 2, mRNA 
Identities = 14/14 (100%) 

Query: 600 ctcgccgttcttct 613 

I I I I I I I I I I I I I I 
Sbjct: 2416 ctcgccgttcttct 2403 

CDS 1..2442 
cadherin ME 3 " 
/protein_id= "NP_1 14 4 4 3.1" 

These gene clusters have an immunoglobulin-like organization, suggesting that a 
novel mechanism may be involved in their regulation and expression. 

>gi 1 19923446 | ref I NM_015963 . 2 | Homo sapiens CGI-36 protein (CGI-36) , mRNA 
Identities = 14/14 (100%) 

Query: 600 ctcgccgttcttct 613 

I I I I I I I I I I I I I I 
Sbjct: 947 ctcgccgttcttct 934 
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CDS 



807. .1565 



>gi|21314399|gb|AF508782.1| Homo sapiens fibroblast growth factor 10 (FGF10) 
mRNA, partial cds 
Identities = 14/14 (100%) 

Query: 604 ccgttcttctcaat 617 
I I I I I I I I I I I I I I 

Sbjct: 173 ccgttcttctcaat 160 
CDS <1 . . 513 

/f unction="paracrine growth factor for epithelia" 
/note="keratinocyte growth factor-2; KGF2; produced by 
fibroblasts of urinary bladder lamina propria" 

>gi I 22058975 I ref | XM_172182 . 1 | Homo sapiens similar to Ribosomal protein S5; 
Minute (1) 15D; Minute; transcript e (LOC255793) , mRNA 
Identities = 14/14 (100%) 

Query: 606 gttcttctcaatgg 619 
I I I I I I I I I I I I I I 

Sbjct: 405 gttcttctcaatgg 392 
CDS 1..438 

>gi I 24797096 | ref | NM_006907 . 2 | Homo sapiens pyrroline-5-carboxylate reductase 1 
(PYCR1), nuclear gene encoding mitochondrial protein, transcript variant 
1 , mRNA 

Identities = 14/14 (100%) 

Query: 607 ttcttctcaatgga 620 

I I I I I I I I I II 1 I I 
Sbjct: 598 ttcttctcaatgga 585 

CDS 279.. 1238 

/note="isof orm 1 is encoded by transcript variant 1; P5C 
reductase" 

>gi I 4186165 | gb| L41919. 1 | HUMHIC1G Homo sapiens candidate tumor suppressor HIC-1 
(HIC-1) gene, complete 
Identities = 20/21 (95%) 

Query: 626 cgccccggccgcgccgcccgc 64 6 
I I I I I I I I I I I I I I I I I I I I 

Sbjct: 2451 cgccccggccgcgccggccgc 2431 CDS: 637.. 2781 

Isolation and embryonic expression of the novel mouse gene Hicl,the homologue of 
HIC1, a candidate gene for the Miller-Dieker syndrome 

>gi | 23238250 I ref I NM_032862 . 2 I Homo sapiens tigger transposable element derived 
5 (TIGD5) mRNA, 
Identities = 18/19 (94%) 

Query: 626 cgccccggccgcgccgccc 644 
I I I I I I I I I I I I I I I I I I 

Sbjct: 510 cgccccggccccgccgccc 528 
CDS 1..1782 

>gi|17939629|gb|BC019288.1|BC019288 Homo sapiens, hepatocyte nuclear factor 3, 
beta, clone 

Identities = 18/19 (94%) 

Query: 626 cgccccggccgcgccgccc 644 
I II I I II I I II I I I I I I I 

Sbjct: 284 cgccccggccgagccgccc 266 
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CDS <1..1370 
/product="hepatocyte nuclear factor 3, beta" 



>gi I 22041040 | ref |XM_092478 . 2 | Homo sapiens similar to Gliacolin (LOC165257 ) , 
mRNA 

Identities = 21/23 (91%) 

Query: 628 ccccggccgcgccgcccgccacc 650 

I I I I I I I I I I I I I I I I I I I I I 
Sbjct: 810 ccccggccccgccgcccaccacc 788 

CDS 471.. 1262 

/product="similar to Gliacolin" 

>gi j 17974541 | gb I AF361354 . 1 | AF361354 Homo sapiens voltage-dependent calcium 
channel gamma-8 subunit (CACNG8 ) mRNA, complete cds 
Identities = 16/16 (100%) 

Query: 628 ccccggccgcgccgcc 643 
I I I I I I I I I I I I I I I I 

Sbjct: 1132 ccccggccgcgccgcc 1117 
CDS 106.. 1386 

>gi I 24475868 | ref | NM_153836 . 1 | Homo sapiens cellular repressor of E1A- 
stimulated genes 2 (CREG2 ) , 
Identities = 19/20 (95%) 

Query: 628 ccccggccgcgccgcccgcc 647 
I I I I I I I I I I I I I I I II I I 

Sbjct: 177 ccccggccgcgccggccgcc 158 
CDS 139.. 1011 

>gi I 21237772 | ref | NM_016431 . 2 | Homo sapiens mitogen-act ivated protein kinase 8 
interacting protein 2(MAPK8IP2), transcript variant 2, mRNA 
Identities = 19/19 (100%) 

Query: 629 cccggccgcgccgcccgcc 547 
I I I I I I I I I I I I I I I I I I I 

Sbjct: 1276 cccggccgcgccgcccgcc 1294 
CDS 1..2394 

/note="PRKM8 interacting protein-like; JNK-interacting protein 2; islet-brain 2; 
JNK MAP kinase scaffold protein JIP2; homologous to mouse JIP-1" 

>gi I 14971412 | ref I NM_015906 . 2 I Homo sapiens tripartite motif-containing 33 
(TRIM33), transcript variant alpha, mRNA 
Identities = 16/16 (100%) 

Query: 629 cccggccgcgccgccc 644 
I I I I I I I I I I I I I I I I 

Sbjct: 267 cccggccgcgccgccc 252 
CDS 85.. 3468 

>gi|13183792|gb|AF336133.1|AF336133 Homo sapiens CECR2 protein (CECR2) mRNA, 
complete cds 

Identities = 16/16 (100%) 

Query: 629 cccggccgcgccgccc 644 

I I I I I I I I I I I I I I I I 
Sbjct: 448 cccggccgcgccgccc 433 

CDS ~~ 419.. 4873 

>gi I 22046663 | ref | XM_047536 . 6 | Homo sapiens obscurin, cytoskeletal calmodulin 
and titin-interacting RhoGEF (OBSCN) , mRNA 
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Identities = 15/15 (100%) 



Query: 631 cggccgcgccgcccg 645 
I I I I I I I I I I I I I I I 

Sbjct: 572 cggccgcgccgcccg 558 

CDS 45.. 18518 

/product= M similar to obscurin" 



>gi I 21707308 | gb | BC033826 . 1 | Homo sapiens, purinergic receptor P2X, ligand- 
gated ion channel, 4, clone MGC: 45331 IMAGE : 52164 4 9, mRNA, complete cds 
Identities = 15/15 (100%) 



Query: 632 ggccgcgccgcccgc 64 6 

I I I I I I I I I I I I I I I 
Sbjct: 24 ggccgcgccgcccgc 10 

CDS 25.. 1191 

/product="purinergic receptor P2X, ligand-gated ion channel, 4" 

>gi | 21450823 I ref | NM_145056 . 1 | Homo sapiens thymus expressed gene 3-like 
(MGC15476), mRNA 
Identities = 16/16 (100%) 



Query: 632 ggccgcgccgcccgcc 647 
I I I I I I I I I I I I I I I I 

Sbjct: 879 ggccgcgccgcccgcc 894 
CDS 441. . 1655 



>gi I 22652729 I gb | AF489858 . 1 I Homo sapiens serum amyloid A activating factor 2 
mRNA, complete cds 
Identities = 16/16 (100%) 



Query: 632 ggccgcgccgcccgcc 647 
I I II I I I I I I I I I I I I 

Sbjct: 550 ggccgcgccgcccgcc 565 

CDS " " 113.. 1594 

/note="transcription factor SAF-2" 



>gi|20558544 I ref | XM_028783 . 2 | Homo sapiens opioid growth factor receptor 
(OGFR) , mRNA 

Identities = 15/15 (100%) 



Query: 632 ggccgcgccgcccgc 646 

I I I I I I I 1 I I I 1 I I I 
Sbjct: 174 ggccgcgccgcccgc 160 
CDS 17 . . 2050 

/product="similar to 7-60 protein" 



>gi | 22042730 | ref | XM_11434 6 . 2 | Homo sapiens similar to source of immunodominant 
MHC-associated peptides (LOC201595) , mRNA 
Identities = 15/15 (100%) 

Query: 633 gccgcgccgcccgcc 647 
I I I I I I I I I I I I I I I 

Sbjct: 34 5 gccgcgccgcccgcc 331 
CDS 200. .2680 

/product =,, similar to source of immunodominant MHC-associated peptides" 

>gi I 4507162 | ref I NM_003107 . 1 | Homo sapiens SRY (sex determining region Y)-box 4 
(SOX4), mRNA 

Identities = 16/16 (100%) 
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Query: 634 ccgcgccgcccgccac 64 9 
I I I I I I I I I I I I I I I I 

Sbjct: 972 ccgcgccgcccgccac 957 
CDS 351. . 1775 



gi 123395757 | tpg | BK000395.il TPA: Homo sapiens aflatoxin Bl-aldehyde reductase 
(AKR7A2) mRNA, 
Identities = 18/19 (94%) 



Query: 640 cgcccgccaccgccgcggc 658 
I I I I I I I I I I I I I I I I I I 

Sbjct: 180 cgcccgccagcgccgcggc 198 
CDS 23.. 1102 



>gi I 5032110 | ref |NM_005634 . 1 | Homo sapiens SRY (sex determining region Y)-box 3 
(SOX3) , mRNA 

Identities = 18/18 (100%) 



Query: 641 gcccgccaccgccgcggc 658 

I I I I I I I I I I I I I I I I I I 
Sbjct: 1044 gcccgccaccgccgcggc 1061 
CDS 1..1332 



>gi I 20589957 | ref I NM_139075 . 1 | Homo sapiens two-pore calcium channel protein 2 
(TPC2) , mRNA 

Identities = 15/15 (100%) 



Query: 64 3 ccgccaccgccgcgg 657 
I I I I I I I I I I I I I I I 

Sbjct: 160 ccgccaccgccgcgg 14 6 
CDS 102.. 2360 



>gi I 16753218 | ref | NM_033224 . 2 | Homo sapiens purine-rich element binding protein 
B (PURB), mRNA 
Identities = 15/15 (100%) 



Query: 643 ccgccaccgccgcgg 657 
I I I I I I I I I I I I I I I 

Sbjct: 480 ccgccaccgccgcgg 466 
CDS 14.. 952 



>gi|291945|gb|L12398.1|HUMD4C Homo sapiens dopamine receptor D4 (DRD4) mRNA, 
complete cds 

Identities = 15/15 (100%) 



Query: 643 ccgccaccgccgcgg 657 
I I I I I I I I I I I I I I I 

Sbjct: 502 ccgccaccgccgcgg 488 
CDS 1..1404 



>gi| 19401873|gb|AF479827.1| Homo sapiens protein kinase-like protein mRNA, 

complete cds 

Identities = 18/19 (94%) 



Query: 644 cgccaccgccgcggctggg 662 

I I I I I I I I I I I I I I I I I I 
Sbjct: 1907 cgccaccgccggggctggg 1889 
CDS 278. .2614 



>gi | 7542578 I gb I AF24 1229 . 1 | AF241229 Homo sapiens GITR-D mRNA, complete cds 
Identities = 15/15 (100%) 
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Query: 651 gccgcggctgggccc 665 
I I I I I I I I I I I I I I I 

Sbjct: 682 gccgcggctgggccc 696 CDS: 1..768 

>gi I 23238193 | ref | NM_148901 . 1 | Homo sapiens tumor necrosis factor receptor 
superfamily, member 18 ( TNFRSF18 ) , transcript variant 2, mRNA 
Identities = 15/15 (100%) 

Query: 652 gccgcggctgggccc 666 
I I I I I I I I I I I I I I I 

Sbjct: 820 gccgcggctgggccc 834 CDS: 139.. 906 

>gi I 11038623 | ref | NM_004 4 26 . 1 I Homo sapiens polyhomeotic-like 1 (Drosophila) 
(PHC1), mRNA 

Identities = 15/15 (100%) 

Query: 671 tcttcacccttgtct 685 

I I I I I I I I I I I I I I I 
Sbjct: 1644 tcttcacccttgtct 1658 

CDS 210. . 3224 

/note="early development regulator 1; mouse Rae28-like" 
>gi|190395|gb|M604 94.1|HUMPROFILA Human profilaggrin gene, 3' end 
Identities = 15/15 (100%) 

Query: 678 ccttgtcttcgtcca 692 
I I I I I I I I I I I II I I 

Sbjct: 4236 ccttgtcttcgtcca 4222 
exon 949. . 4447 

CDS 1478. .4447 

>gi ! 24475953 | ref | NM_013433 . 2 | Homo sapiens karyopherin beta 2b, transportin 
(TRN2) , mRNA 

Identities = 14/14 (100%) 

Query: 678 ccttgtcttcgtcc 691 

I I I I I I I I I I I I I I 
Sbjct: 2026 ccttgtcttcgtcc 2013 

CDS 292. .2955 

/note="hypothetical protein FLJ12155" 

>gi I 8923472 | ref I NM_017852 . 1 I Homo sapiens NALP2 protein (NALP2), mRNA 
Identities = 15/15 (100%) 

Query: 685 ttcgtccacgt ctag 699 

I I I I I I I I I I I I I I I 
Sbjct: 453 ttcgtccacgtctag 439 

CDS 88.. 3276 

/note="PYRIN-Containing APAFl-like" 

>gi I 10198206 | gb| AF298547 . 1 1 Homo sapiens nucleotide-binding site protein 1 
mRNA , c omp 1 e t e cds 
Identities = 15/15 (100%) 

Query: 685 ttcgtccacgtctag 699 

I I I I I I I I I I I I I I I 
Sbjct: 425 ttcgtccacgtctag 411 

CDS 78.. 3179 

/note="NBSl ; nucleotide-binding site/leucine-rich repeat 
(NBS/LRR) family member" 
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>gi | 4 504 57 6 I ref | NM_002164 . 1 | Homo sapiens indoleamine-pyrrole 2,3 dioxygenase 
(INDO), mRNA 

Identities = 17/17 (100%) 

Query: 691 cacgtctagttctggga 707 
I I I I I I I I I I I I I I I I I 

Sbjct: 258 cacgtctagttctggga 274 
CDS 23.. 1234 

>Numatrin) pseudogene and the MDFI gene for MyoD family inhibitor (myogenic 
repressor I-MF 
Identities = 15/15 (100%) 

Query: 697 tagttctgggacctc 711 
I I I I I I I I I I I I I I I 

Sbjct: 64761 tagttctgggacctc 64775 

CDS complement (64067. .64810) 

/note="dJ696P19. 2 (NPM1 (Nucleophosmin, Numatrin) pseudogene) 

>gi|22047835|ref |XM_095174 .31 Homo sapiens similar to pol protein (LOC168550) , 
mRNA 

Identities = 14/14 (100%) 

Query: 699 gttctgggacctcc 712 
I I I I I I I I I I I I I I 

Sbjct: 1249 gttctgggacctcc 1262 
CDS 1..3315 

>gi | 4506508 | ref | NM_00292 6 . 1| Homo sapiens regulator of G-protein signalling 12 

( RGS12 ) , mRNA 

Identities = 15/15 (100%) 

Query: 700 ttctgggacctcccg 714 
I I I I I I I I I I I I I I I 

Sbjct: 2786 ttctgggacctcccg 2772 CDS: 55.. 4185 

>gi | 22771013 | gb | AF542391 . 1 | Homo sapiens selectin P (granule membrane protein 
140kDa, antigen CD62) (SELP) gene, complete cds 
Identities = 17/18 (94%) 

Query: 705 ggacctcccgctcaagag 722 
I I I I I I I I I I I I I I I I I 

Sbjct: 6869 ggacct cctgctcaagag 6886 
gene <1 . . >398 

>gi | 4758621 I ref I NM_004 770 . 1 I Homo sapiens potassium voltage-gated channel, 
Shab-related subfamily, member 2 (KCNB2 ) , mRNA 
Identities = 14/14 (100%) 

Query: 711 cccgctcaagagcc 724 

I I I I I I I I I I I I I I 
Sbjct: 1329 cccgctcaagagcc 1316 

CDS 3.. 2423 

>gi I 21595817 |gb| BC032731 . 1 | Homo sapiens, Wolf-Hirschhorn syndrome candidate 
1, Identities = 14/14 (100%) 

Query: 713 cgctcaagagccag 726 
I I I I I I I I I I I I I I 

Sbjct: 1876 cgctcaagagccag 1889 
source 1 . . 4239 
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>gi I 191724 10 | gb | AF4804 61 . 1 1 Homo sapiens mixed lineage kinase-related kinase 
MRK-alpha mRNA, 
Identities = 14/14 (100%) 

Query: 715 ctcaagagccagtg 728 
I I I I I I I I I I I I I I 

Sbjct: 2454 ctcaagagccagtg 2467 
CDS 196.. 2598 

>gi I 23272700 | gb | BC035910 . 1 | Homo sapiens, mitogen-activated protein kinase 
kinase kinase 7 interacting protein 2, 
Identities = 14/14 (100%) 

Query: 727 tggtacaccagaag 740 
I I I I I I I I I I I I I I 

Sbjct: 1758 tggtacaccagaag 1771 
CDS 149.. 2230 

>gi I 10947029 | ref | NM_006217 . 2 | Homo sapiens serine (or cysteine) proteinase 
inhibitor, clade I (neuroserpin) , member 2 (SERPINI2), mRNA 
Identities = 19/20 (95%) 

Query: 737 gaagtctacttttttttcta 756 
I I I I I I I I I I I I I I Mill 

Sbjct: 927 gaagtctactttt tgttct a 908 
CDS 34.. 1251 

>gi I 24638453 | ref | NM_170665 . 1 | Homo sapiens ATPase, Ca++ transporting, cardiac 
muscle, slow twitch 2 (ATP2A2), mRNA 
Identities = 14/14 (100%) 

Query: 765 actttgtcaccaac 778 
I I I I I I I I I I I I I I 

Sbjct: 586 actttgtcaccaac 573 
CDS 111.. 3239 

>gi I 21536291 | ref |NM_001081.2 j Homo sapiens cubilin (intrinsic f actor-cobalamin 
receptor) (CUBN) , 
Identities = 14/14 (100%) 

Query: 765 actttgtcaccaac 778 

I I I I I I I I I I I I I I 
Sbjct: 8029 actttgtcaccaac 8042 

CDS 27 . . 10898 

/note="intrinsic f actor-cobalamin receptor; intrinsic factor B12-receptor " 

>gi I 6912355 | ref | NM_012155 . 1 | Homo sapiens echinoderm microtubule associated 
protein like 2 (EML2), 
Identities = 14/14 (100%) 

Query: 766 ctttgtcaccaact 779 
I I I I I I I I I I I I I I 

Sbjct: 1595 ctttgtcaccaact 1608 
CDS 36.. 1985 

>gi 1185 97 004 | re f| XM_051693 . 4 J Homo sapiens mitogen inducible 2 (MIG2), mRNA 
Identities = 15/15 (100%) 

Query: 768 ttgtcaccaacttct 782 
I I I I I I I I I I I I I I I 

Sbjct: 838 ttgtcaccaacttct 852 
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CDS: 238. .2280 



>gi I 3335149 | gb I AF055377 . 1 | AF055377 Homo sapiens long form transcription factor 
C-MAF (c-maf) mRNA, 
Identities = 15/15 (100%) 



Query: 771 tcaccaacttctcgt 785 

I I I I I I I I I I I I I I I 
Sbjct: 1864 tcaccaacttctcgt 1850 
CDS 808.. 2019 

/note="b-zip transcription factor" 



>gi | 7304 920 | ref | NM_0134 4 9 . 1 | Homo sapiens bromodomain adjacent to zinc finger 
domain, 2A (BAZ2A) , 
Identities = 14/14 (100%) 



Query: 800 tgagtggaggacta 813 
I I I I I I I I I I I I I I 

Sbjct: 1587 tgagtggaggacta 1600 
CDS 740.. 6376 



>gi|5419653|emb|AL034553.12|HS914P20 Human DNA sequence from clone RP5-914P20 
on chromosome 20ql3 . 13-13 . 2 Contains the gene for activity-dependent 
neuroprotective protein (ADNP, KIAA0784) , a PSMD10 (proteasome (prosome, 
macropain) 26S subunit, non-ATPase, 10) pseudogene, the DPMI gene fo> 
Identities = 19/20 (95%) 



Query: 803 gtggaggactaataagactt 822 
I I I I I I I I I I I I I I I I I I I 

Sbjct: 10723 gtggaggactaatgagactt 10704 misc_f eature : 10509.. 10953 

>gi I 14388625 | gb| AF243083. 1 | F243081S03 Homo sapiens intrinsic factor-vitamin 
B12 receptor (CUBN) gene, exons 5 and 6 
Identities = 16/16 (100%) 

Query: 817 gacttatatactgtcc 832 

I I I I I I I I I I I I I I I I 
Sbjct: 809 gacttatatactgtcc 824 
CDS AF243085. 1:692. .854 



>gi I 24527258 | gb | AY071904 . 1 | Homo sapiens ribonuclease/angiogenin inhibitor 
(RNH) mRNA, complete 
Identities = 17/18 (94%) 



Query: 824 atactgtccgtt ctttga 841 
I I I I I I I I I I I I I I I I I 

Sbjct: 532 atactgtcagt tct ttga 515 
CDS 1. .1386 



>gi I 9558724 | ref | NM_013291 . 1 | Homo sapiens cleavage and polyadenylat ion 
specific factor 1, 160kDa (CPSF1 ) , mRNA 
Identities = 14/14 (100%) 



Query: 834 ttctttgagggagg 847 

I I I I I I I I I I I I I I 
Sbjct: 1313 ttctttgagggagg 1300 
CDS 52.. 4380 



>gi I 20143921 1 ref I NM_1334 37 . 1 1 Homo sapiens titin (TTN) , transcript variant 

novex-2, mRNA 

Identities = 14/14 (100%) 
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Query: 838 ttgagggaggacct 851 
I I I I I I I I I I I I I I 

Sbjct: 56027 ttgagggaggacct 56014 

CDS 224.. 81580 

/note="isof orm novex-2 is encoded by transcript variant 
novex-2; connectin; CMH9, included" 



Query: 839 tgagggaggacctc 852 

I I I I I I I I I I I I I I 
Sbjct: 819 tgagggaggacctc 832 

CDS " 100. .1155 

/product="similar to ARP2/3 complex 41 kDa subunit { P4 1-ARC) ( Actin-related 
protein 2/3 complex subunit IB) " 

>gi | 21426828 | ref | NM_144773 . 1 | Homo sapiens G protein-coupled receptor 73-like 
1 (GPR7 3L1), mRNA 

Identities = 14/14 (100%) 



Query: 848 gacctccctatgga 861 
I I I I I I I I I I I I I I 

Sbjct: 100 gacctccctatgga 113 CDS: 1..1155 



>gi I 21327026 | gb | AF506288 . 1 | Homo sapiens prokineticin receptor 2 (PKR2) mRNA, 
complete cds 

Identities = 14/14 (100%) 

Query: 848 gacctccctatgga 861 
I I I I I I I I I I I I I I 

Sbjct: 100 gacctccctatgga 113 CDS 1..1155 

>gi I 7669541 | ref | NM_013992 . 1 | Homo sapiens paired box gene 8 (PAX8), transcript 
variant PAX8E, 
Identities = 14/14 (100%) 

Query: 849 acctccctatggac 862 
I I I I I I I I I I I I I I 

Sbjct: 582 acctccctatggac 595 CDS 161.. 1024 

>gi I 16160856 | ref | XM_007763 . 5 | Homo sapiens myosin VA (heavy polypeptide 12, 
myoxin) (MY05A) , mRNA 
Identities = 19/20 (95%) 



Query: 863 cgtaactggagagtctgggg 882 

I I I I I I I I I I I I I I I I I I I 
Sbjct: 724 cgt aagt ggagagt ctgggg 743 

CDS " 245.. 5812 

/product="similar to Myosin Va (Myosin 5A) (Dilute myosin 
heavy chain, non-muscle) (Myosin heavy chain 12) (Myoxin)" 

>gi I 4757807 | ref | NM_001683 . 1 | Homo sapiens ATPase, Ca+ + transporting, plasma 
membrane 2 (ATP2B2) , 
Identities = 15/15 (100%) 



Query: 865 taactggagagtctg 879 

I I I I I I I I I I I I I I I 
Sbjct: 1298 taactggagagtctg 1312 

CDS 577.. 4173 

/note="PMCA-2" 
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>gi | 1 6904 386 | re f | NM_0 13363 . 2 I 
enhancer 2 ( PCOLCE2 ) , mRNA 
Identities = 14/14 (100%) 



Homo sapiens procollagen C-endopeptidase 



Query: 867 actggagagtctgg 880 
I I I I I I I I I I I I I I 

Sbjct: 308 actggagagtctgg 321 
CDS ^ ' 197.. 1444 

>gi I 7706548 | ref | NM_016507 . 1 | Homo sapiens CDC2-related protein kinase 7 
(CRK7 ) , mRNA 

Identities = 15/15 (100%) 

Query: 868 ctggagagtctgggg 882 

I I I I I I I I I I I I I I I 
Sbjct: 2092 ctggagagtctgggg 2078 

CDS 34.. 4506 

>gi I 24850118 | ref | NM_170605 . 1 | Homo sapiens PDZ domain protein (Drosophila 
inaD-like) (INADL), mRNA 
Identities = 14/14 (100%) 

Query: 868 ctggagagtctggg 881 
I I I I I I I I I I I I I I 

Sbjct: 1570 ctggagagtctggg 1557 
CDS 1. .5406 

>gi I 21536251 | ref | NM_015678 . 2 | Homo sapiens neurobeachin (NBEA) , mRNA 
Identities = 14/14 (100%) 
Strand = Plus / Plus 

Query: 868 ctggagagtctggg 879 
I I I I II I I I I I I I I 

Sbjct: 3008 ctggagagtctggg 3021 
CDS 207.. 9047 

>gi I 21434742 | gb| AF467288. 1 | Homo sapiens BCL8B protein (BCL8B) mRNA, complete 
Identities = 14/14 (100%) 

Query: 868 ctggagagtctggg 881 
I I I I I I I I I I I I I I 

Sbjct: 3008 ctggagagtctggg 3021 
CDS 207.. 9047 

>gi|7662409|ref I NM_014 963 . 1 I Homo sapiens KIAA0963 protein (KIAA0963), mRNA 
Identities = 15/15 (100%) 

Query: 868 ctggagagtctgggg 882 
I I I I I I I I I I I I I I I 

Sbjct: 2832 ctggagagtctgggg 2846 CDS 216.. 4316 

>gi | 22041826 | ref |XM_172259. 1 I Homo sapiens similar to 60S ribosomal protein 
L21 (LOC255888), mRNA 
Identities = 16/16 (100%) 
Query: 869 tggagagtctggggtt 884 

I I I I I I I I I I I I I I I I 
Sbjct: 530 tggagagtctggggtt 545 
CDS 1. .579 

>gi I 17157996 | ref | NM_058167 . 1 | Homo sapiens ubiquitin-conjugating enzyme E2, J2 
(UBE2J2), mRNA 
Identities = 14/14 (100%) 
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Query: 870 ggagagtctggggt 883 
I I I I I I I I I I I I I I 

Sbjct: 652 ggagagtctggggt 639 
CDS 205 . . 879 

>gi | 23308602 | ref | NM_0154 60 . 1 1 Homo sapiens myosin VIIA and Rab interacting 
protein (MYRIP) , mRNA 
Identities = 15/15 (100%) 

Query: 871 gagagtctggggttc 885 
I I I I I I I I I I I I I I I 

Sbjct: 1861 gagagtctggggttc 1847 
CDS 137.. 2716 

>gi I 6912705 | ref | NM_012455 . 1 I Homo sapiens SEC7 homolog (TIC), mRNA 
Identities = 14/14 (100%) 

Query: 872 agagtctggggttc 885 

I I I I I I I I I I I I I I 
Sbjct: 762 agagtctggggttc 749 

CDS 64.. 3234 

/note="ADP-ribosylation factor guanine nucleot ide-exchange 
factor 6" 

>gi I 22051239 | ref |XM_048346.4 | Homo sapiens insulin receptor (INSR), mRNA 
Identities - 14/14 (100%) 

Query: 884 tcgttgaccgtctt 897 
I I I I I I I I I I I I I I 

Sbjct: 3251 tcgttgaccgtctt 3238 
CDS 181.. 4218 

>gi I 4507600 | ref | NM_003807 . 1 j Homo sapiens tumor necrosis factor (ligand) 
superfamily, member 14 ( TNFSF1 4 ) , mRNA 
Identities = 14/14 (100%) 

Query: 901 cggtcttacttcgg 914 
I I I I I I I I I I I I I I 

Sbjct: 742 cggtcttacttcgg 755 
CDS 49. . 771 

>gi I 2745709 I gb | U89310 . 1 | AH005788S02 Homo sapiens nucleophosmin phosphoprotein 
(NPM) gene, exon 2 
Identities = 14/14 (100%) 

Query: 908 acttcggttctttt 921 
I I I I I I I I I I I I I I 

Sbjct: 411 acttcggttctttt 424 CDS U8 9311 . 1 : 4 1 6 . . 535 

>gi I 20521002 |dbj | AB002333. 2 | Human mRNA for KIAA0335 gene, partial cds 
Identities = 18/18 (100%) 

Query: 914 gttctttttaatttcttc 931 
I I II I I I I I I I I I I I I I I 

Sbjct: 216 gttctttttaatttcttc 233 CDS <829..5283 

>gi | 19923778 I ref |NM_006479.2 I Homo sapiens RAD51-interacting protein (PIR51), 
mRNA 

Identities = 16/16 (100%) 
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Query: 915 ttctttttaatttctt 930 
I I I I I i I I I I I I I I I I 

Sbjct: 703 ttctttttaatttctt 688 
CDS 51 . . 1058 

>gi I 4 505610 | ref I NM_002582 . 1 j Homo sapiens poly (A) -specif ic ribonuclease 
(deadenylation nuclease) ( PARN ) , mRNA 
Identities = 16/16 (100%) 

Query: 915 ttctttttaatttctt 930 
I I I I I I I I I I I I I I I I 

Sbjct: 1897 ttctttttaatttctt 1882 
CDS 58.. 1977 

>gi I 20538519 | ref | XM_057 659 . 6 I Homo sapiens similar to RIKEN cDNA 2310005N03 
(LOC116228) , mRNA 
Identities = 18/18 (100%) 

Query: 915 ttctttttaatttcttct 932 
I I I I I I I I I I I I I I I I I I 

Sbjct: 629 ttctttttaatttcttct 612 CDS 337 .. 693 

>gi | 18 57 934 8 j ref | XM_0902 94 . 1 | Homo sapiens similar to 10- 
f ormyltetrahydrof olate dehydrogenase (LOC160428 ) , mRNA 
Identities = 19/19 (100%) 

Query: 916 tctttttaatttcttctac 934 
I I I I i I I I I I I I I I I I I I 1 

Sbjct: 2263 tctttttaatttcttctac 2245 CDS 1..2928 

>gi I 4503510 | ref | NM__003758 . 1 | Homo sapiens eukaryotic translation initiation 
factor 3, subunit 1 alpha, 35kDa (EIF3S1), mRNA 
Identities = 17/17 (100%) 

Query: 916 tctttttaatttcttct 932 
I I 1 I I I I I I I I I I I I I I 

Sbjct: 352 tctttttaatttcttct 336 CDS 61.. 837 

>gi | 23271901 | gb | BC036021 . 1 | Homo sapiens, Similar to Bmp2-inducible kinase, 
Identities = 20/21 (95%) 

Query: 917 ctttttaattt cttctactac 937 
I I I I I I I I I I I I I I I I I I I I 

Sbjct: 796 ctttt taatttcttctt ctac 776 
CDS 128.. 2116 

>gi I 18375633 | ref | NM_004 639 . 2 | Homo sapiens HLA-B associated transcript 3 
(BAT3), transcript variant 1, mRNA 
Identities = 15/15 (100%) 

Query: 931 ctactacgaggttct 945 
I I I I I I I I I I I I I I I 

Sbjct: 1172 ctactacgaggttct 1186 CDS 285.. 3683 

>gi| 17485359|ref I XM_06637 1 . 1 I Homo sapiens LOC129184 (LOC129184), mRNA 
Identities = 16/16 (100%) 

Query: 1016 tgtgggtgccagggtc 1031 
I I I I I I I I I I I I I I I I 

Sbjct: 866 tgtgggtgccagggtc 851 CDS 1..960 
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>gi | 12653994 | gb| BC0007 95 . 1 I BC0007 95 
Identities = 15/15 (100%) 



Homo sapiens, hypothetical protein. 



Query: 1017 gtgggtgccagggtc 1031 
I I I I I I I I I I I I I I I 

Sbjct: 306 gtgggtgccagggtc 292 CDS 49.. 1260 

>gi|1668741|emb|X90762.1|HSHHA5GEN Homo sapiens hHa5 gene 
Identities = 16/16 (100%) 

Query: 1018 tgggtgccagggtctc 1033 
I I I I I I I I I I I I I I I I 

Sbjct: 1215 tgggtgccagggtctc 1230 exon 1112.. 1365 number=8 

>gi I 18582587 | ref I XM_090689 . 1 I Homo sapiens similar to S antigen precursor - 

malaria parasite ( Plasmodium falciparum) (strain Wellcome) (LOC161088 ) , 

mRNA 

Identities = 19/20 (95%) 

Query: 1019 gggtgccagggtctcaggtg 1038 
I I I I I I I I I I I I I I I I I I I 

Sbjct: 372 gggtgccagcgtctcaggtg 353 CDS 1 . . 1221 Chr. 13 

>gi I 9863549 | embl AL157718 . 10 | Human DNA sequence from clone RP11-23013 on 
chromosome 20 Contains a putative novel gene, a CpG island, ESTs and GSSs, 
complete sequence [Homo sapiens] 
Identities = 19/20 (95%) 

Query: 1021 gtgccagggtctcaggtgca 1040 
I I I I I I I I I I I I I I I I I I I 

Sbjct: 40364 gtgccagggtctcagctgca 40345 

misc_feature complement (40329. . 40883) note="match: GSS: Em : AQ6064 91 " 

>gi | 35258 | emb | X13345 . 1 I HSPAI19 Human gene for plasminogen activator inhibitor 
1 (PAI-1) exon 9/ Identities = 19/20 (95%) 

Query: 1024 ccagggtctcaggtgcagac 1043 
I I I I I I I I I I I I I I I I 1 I I 

Sbjct: 264 ccagggt ctcaggtggagac 245 

precursor_RNA <1..1835 /note="primary transcript" 

>gi | 18581862 I ref |XM_090590. 1 I Homo sapiens LOC160925 (LOC160925 ) , mRNA 
Identities = 15/15 (100%) 

Query: 1064 agcaactctcgagtt 1078 
I I I I I I I I I I I I I I I 

Sbjct: 374 agcaactctcgagtt 360 CDS 1..870 

>gi I 24660383 | gb | BC039025 . 1 | Homo sapiens, Similar to tyrosine 3- 
monooxygenase/tryptophan 5-monooxygenase activation protein, epsilon 
polypeptide, clone MGC: 47805 IMAGE : 6070099, mRNA, 
Identities = 17/17 (100%) 



Query: 1077 tttcgattttgctgtgg 1093 
I I I I I I 1 I I I I I I 1 I I I 

Sbjct: 58 tttcgattttgctgtgg 74 
CDS 107.. 883 



>gi I 18253109 | dbj | AB065437 . 1 | Homo sapiens Cls gene for complement Cls, 
promoter region and exon 1 
Identities = 15/15 (100%) 
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Query: 1112 ttttcgggaaagtca 1124 (Exon wechsel) 

I I I I I I I I I I I I I I I 
Sbjct: 2584 ttttcgggaaagtca 2570 

promoter 1 . . 2826 

/f unction=" complement activation" 
exon 2827.. 2978 



>gi | 22047240 | ref | XM_175003 . 1 | Homo sapiens LOC256626 (LOC256626) , mRNA 
Identities = 15/15 (100%) 

Query: 1785 gaaaagtgacctgaa 1799 
I I I I I I I I I I I I I I I 

Sbjct: 1768 gaaaagtgacctgaa 1754 CDS 1..1914 

>gi I 11990557 | gb| AF170052 . 1 | AF170052 HIV-2 isolate 97227 from France envelope 
glycoprotein (env) gene, partial cds 
Identities = 15/15 (100%) 

Query: 1825 gtatggcctctgtcc 1839 
I I I I I I I I I I I I I I I 

Sbjct: 866 gtatggcctctgtcc 852 CDS <1..>2243 

>gi | 4502528 I ref I NM_000721 . 1 | Homo sapiens calcium channel, voltage-dependent, 
alpha IE subunit (CACNA1E) , mRNA, note="brain specific" 
Identities = 15/15 (100%) 

Query: 1827 atggcctctgtccgg 1841 
I I I I I I I I I I I I I I I 

Sbjct: 1433 atggcctctgtccgg 1419 CDS 166.. 6921 

>gi | 20559017 | ref |XM_166786. 1 | Homo sapiens similar to SUMO-1 activating enzyme 
subunit 1; SUMO-1 activating enzyme El N subunit; sentrin/SUMO-activating 
protein AOS1; ubiquitin-like protein SUMO-1 activating enzyme (LOC220311), mRNA 
Identities = 16/16 (100%) 



Query: 1842 gatacacacggggaag 1857 

I I I I I I I I I I I I I I I I 
Sbjct: 886 gatacacacggggaag 901 CDS 1..1134 

/gene="LOC220311">gi | 4758617 | ref I NM_004693 . 1 | Homo sapiens cytokeratin type II 
(K6HF) , mRNA 

Identities = 16/16 (100%) 

Query: 1845 acacacggggaagctg 1860 
I I I I I I I I I I I I I I I I 

Sbjct: 366 acacacggggaagctg 351 CDS 19.. 1674 

>gi I 17149843 | ref | NM_057092 . 1 | Homo sapiens FK506 binding protein 2, 13kDa 
(FKBP2), transcript variant 2, mRNA 
Identities = 14/14 (100%) 



Query: 1847 acacggggaagctg 1860 

I I I I I I I I I I I I I I 
Sbjct: 269 acacggggaagctg 282 
CDS 103.. 531 

misc_feature 223.. 504 

/note="FKBP; Region: FKBP-type pept idyl-prolyl cis-trans isomerase" 

>gi I 20127426 | ref | NM_002252 . 2 | Homo sapiens potassium voltage-gated channel, 
delayed-rectif ier, subfamily S, member 3 (KCNS3) , mRNA 
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Identities = 15/15 (100%) 

Query: 1847 acacggggaagctgc 1861 
I I I I I I I I I I I I I I I 

Sbjct: 653 acacggggaagctgc 667 
CDS 403 . . 1878 

>gi I 24981010 | gb | BC039695 . 1 | Homo sapiens, Janus kinase 2 (a protein tyrosine 
kinase) , clone 
Identities = 14/14 (100%) 

Query: 1865 cattattcttcaaa 1878 
I I I I I I I I I I I I I I 

Sbjct: 3354 cattattcttcaaa 3341 
CDS 108.. 3497 

>gi | 20149552 | ref | NM_004414 . 3 | Homo sapiens Down syndrome critical region gene 
1 (DSCR1), mRNA 
Identities = 15/15 (100%) 

Query: 1872 cttcaaacgagtcag 1886 
I I I I I I I I I I I I I I I 

Sbjct: 215 cttcaaacgagtcag 229 
CDS 66.. 659 

>gi I 6648540 | gb | U53821 . 1 | HSU53821 Homo sapiens adapt78 protein gene, partial 
Identities = 15/15 (100%) 

Query: 1872 cttcaaacgagtcag 1886 
I I I I I I I I I I I I I I I 

Sbjct: 219 cttcaaacgagtcag 233 
CDS 70..>562 

>gi I 8922685 | ref | NM_018228 . 1 | Homo sapiens hypothetical protein FLJ10811 
(FLJ10811), mRNA 
Identities = 17/17 (100%) 

Query: 1955 ctggaagagctggggcc 1971 
I I I I I I I I I I I I I I I I I 

Sbjct: 923 ctggaagagctggggcc 907 CDS 146.. 2254 

>gi 1 17298301 | gb | AF283402 . 1 | F283327S74 Homo sapiens candidate tumor suppressor 
protein (LRP1B) gene, exon 76 
Identities = 17/17 (100%) 

Query: 2034 ttcttaaaatttttact 2050 

I I I I I I I I I I I I I I I I I 
Sbjct: 307 ttcttaaaatttttact 291 CDS 
join (AF28337 6. 1 :<28 5. . 4 07, AF28 337 7 

>gi|20545629|ref |XM_121159.1| Homo sapiens LOC206321 (LOC206321), mRNA 
Identities = 25/28 (89%) 

Query: 2133 agatagaacgagacattagagcaaagtt 2161 
I I I I I I I I I I I I I I I I I I I I I I I I I 

Sbjct: 527 agatagaacgagatctgagagcaaagtt 500 CDS 1..1176 

>gi I 22054 64 6 | ref | XM_069110 . 2 | Homo sapiens similar to hypothetical protein 
FLJ23231 (LOC134973), mRNA 
Identities = 16/16 (100%) 



56 



Query: 2155 caaagtttttgttcca 2170 
I I I I I I I I I I I I I I I I 

Sbjct: 915 caaagtttttgttcca 930 CDS 



1. .2475 



>gi I 6382057 | ref | NM_007313 . 1 1 Homo sapiens v-abl Abelson murine leukemia viral 
oncogene homolog 1(ABL1), transcript variant b, mRNA 
Identities = 18/18 (100%) 

Query: 2163 ttgttccacaaaaacatt 2180 
I I I I I I I I I I I II I I I I I 

Sbjct: 135 ttgttccacaaaaacatt 118 CDS 1..3447 

>gi|3095103|gb|AF044579.1|AF044579 Homo sapiens translocation related non- 
coding gene (TNRG10) mRNA, complete sequence 
Identities = 17/17 (100%) 



Query: 2164 tgttccacaaaaacatt 2180 

I I I I I I I I I I I I I I I I I 
Sbjct: 2361 tgttccacaaaaacatt 2345 
gene 1 . . 2726 

repeat_region 326.. 547 
repeat_region 2599 . .2709 



Intron (BB/SHR) 

>gi I 5453963 | ref | NM_006251 . 1 1 Homo sapiens protein kinase, AMP-activated, alpha 
1 catalytic subunit (PRKAA1), mRNA 
Identities = 16/16 (100%) 



Query: 316 tcttctgagcactcaa 331 

I I I I I I I I I I I I I I I I 
Sbjct: 943 tcttctgagcactcaa 928. 

CDS 24 . .1676 

note="AMPK alpha 1; Protein kinase, AMP-activated, catalytic, alpha-1" 

>gi | 23503326 | ref | NM__018682 . 2 | Homo sapiens myeloid/lymphoid or mixed-lineage 
leukemia 5 (trithorax homolog, Drosophila) (MLL5) , mRNA 
Identities = 16/16 (100%) 



Query: 365 catcagatgaaggatc 380 

I I I I I I I I I I I I I I I I 
Sbjct: 1003 catcagatgaaggatc 988 

CDS 202.. 5778 

/note="contains PHD and SET domains; similar to Drosophila trithorax" 
/product="myeloid/lymphoid or mixed-lineage leukemia 5 

>gi|1418773|emb|X97186.1|HSE14 H. sapiens mRNA for E14 protein 
Identities = 17/17 (100%) 



Query: 428 gaagacaaagagttctt 444 

I I I I I I I I I I I I I I I I I 
Sbjct: 1509 gaagacaaagagttctt 1493 
CDS 35. .4318 



>gi I 1381666 | gb | U58852 . 1 | HSU58852 Human NPAT mRNA, complete cds 
Identities = 17/17 (100%) 



Query: 428 gaagacaaagagttctt 444 
I I I I I I I I I I I I I I I I I 

Sbjct: 1475 gaagacaaagagttctt 1459 
CDS 1..3528 
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/note= n predicted amino acids have three regions which share similarity to 
annotated domains of transcriptional factor oct-1, nucleolus-cytoplasm shuttle 
phosphoprotein and protein kinases" 

>gi I 22065878 | ref I XM_04084 6 . 5 I Homo sapiens nuclear protein, ataxia- 
telangiectasia locus (NPAT) , 
Identities = 17/17 (100%) 



Query: 428 gaagacaaagagttctt 444 
I I I I I I I I I I I I I I I I I 

Sbjct: 1509 gaagacaaagagttctt 1493 

CDS 35 . .4318 

/product="similar to NPAT" 



>gi | 18079322 I ref |NM_080612 . 1 I Homo sapiens GRB2-associated binding protein 3 
(GAB3), mRNA 

Identities = 17/17 (100%) 



Query: 578 ggggtccaagaccagag 594 

I I I I I I I I I I I I I I I I I 
Sbjct: 1254 ggggtccaagaccagag 1238 

CDS 33.. 1793 

/f unction= "different iat ion signaling" 



>gi I 20270211 | ref j NM_033396 . 1 | Homo sapiens tankyrase 1 binding protein 1, 
182kDa (TNKS1BP1), mRNA 
Identities = 16/16 (100%) 



Query: 583 ccaagaccagagtaaa 598 

I I I 1 I I I I I I I I I I I I 
Sbjct: 2725 ccaagaccagagtaaa 2740 

CDS 308 . . 5497 

/product="tankyrase 1-binding protein of 182 kDa" 



>gi I 21626462 | ref | NM_000038 . 2 | Homo sapiens adenomatosis polyposis coli (APC) , 
Identities = 16/16 (100%) 



Query: 616 gaccaaaaaggaactg 631 

I I I I I I I 1 I I I I I I I I 
Sbjct: 8232 gaccaaaaaggaactg 8247 

CDS 39.. 8570 

/product="adenomatosis polyposis coli" 
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